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DIPEPTIDYLPEPTIDASES AND METHODS OF USE 

RELATED APPLICATIONS 

This application claims the benefit of the U.S. Provisional Application 
No. 60/246,827, filed November 8, 2000, which is incorporated by reference in 
its entirety. 

GOVERNMENT FUNDING 
The present invention was made with government support under Grant 
No. DE 09761, awarded by the National Institutes of Health. The Government 
may have certain rights in this invention. 



BACKGROUND 

Porphyromonas gingivalis (P. gingivalis), an oral anaerobic bacterium, 
has been implicated as a causative agent of adult type periodontitis. As an 
asaccharolytic organism, P. gingivalis is totally dependent on external sources of 

20 peptides that are necessary for its growth and proliferation. In order to fulfill 
such a fastidious nutritional requirement this bacterium evolved a complex 
system of proteolytic enzymes which are now recognized as important virulence 
factors in the development of periodontal disease (Travis et al., J. Adv. Exp. 
Med. Biol, 477:455-65 (2000)). The best known and well characterized 

25 enzymes of this system are gingipains R and K, arginine and lysine specific, 
cysteine proteinases (Curtis et al., J. Periodontal Res., 34:464-72 (1999)). 
Working in concert with the proteinases periodontain (Nelson et al., J. Biol. 
Chem., 274:12245-51 (1999)), collagenases/gelatinases (Birkedal-Hansen et al., 
J. Periodontal Res., 23:258-64 (1988); Lawson et al., Infect Immun., 60:1524-29 

30 (1992); Kato et al., /. Bacteriol, 174:3889-95 (1992), prtT (Otogoto et al., 
Infect. Immun., 61:117-23 (1993)), and Tpr (Bourgeau et al., Infect Immun., 
60:3186-92 (1992)) as well as host proteinases, this array of enzymes has the 
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potential to degrade proteins from both the periodontal ligamentum and 
surrounding tissues. Their concerted action leads to the formation of a large 
pool of oligopeptides, which can be further utilized by P. gingivalis and other 
oral bacteria. However, P. gingivalis cannot transport poly- and oligo- peptides 

5 into the cell, even though it has the ability to thrive on dipeptides as a sole source 
of carbon. This has led to an interest in studying a specialized group of P. 
gingivalis peptidases capable of hydrolyzing oligopeptides to di- and tripeptides, 
which can be subsequently metabolized by this periodontopathogen. The 
purification, characterization and cloning of prolyl tripeptidylpeptidase A 

10 (PtpA), an enzyme which liberates tripeptides from the N-terminal regions of 
substrates containing proline residues in the third position has been previously 
reported (Banbula et al., /. Biol Chem., 274:9246-52 (1999)). 
Dipeptidylpeptidase-IV (DPP-IV), an enzyme with similar, specificity, but only 
dipeptylpeptidase activity, has also been cloned (Kiyama et al., Biochim. 

15 Biophys. Acta, 1396:39-46 (1998)), purified, and characterized (Kumagai et al., 
Infect. Immun., 68:716-24 (2000); Banbula et al., Infect Immun., 68:1176-82 
(2000)). Together with a recently described angiotensinogen-converting enzyme 
analogue (Awano et al., FEBSLett., 460:139-44 (1999)) all of these proteases 
can hydrolyze peptide bonds containing proline residues. In addition, the P. 

20 gingivalis genome contains three further putative coding sequences encoding 
proteinases homologous with dipeptidylpeptidase-IV, although their activities 
have not yet been identified (Banbula et al., 7. Biol Chem. y 274:9246-52 (1999)). 

SUMMARY OF THE INVENTION 

25 In one aspect, the present invention provides an isolated 

dipeptidylpeptidase, active analog, active fragment, or active modification 
thereof having amidolytic activity for cleavage of a peptide bond between the 
second and third amino acids from the N-terminal end of a target polypeptide, 
wherein the target polypeptide has an aliphatic or an aromatic residue as a 

30 substituent on the a-carbon atom of the second amino acid from the N-terminal 
end of the polypeptide. Preferably, the dipeptidylpeptidase is isolated from 
Porphyromonas gingivalis. Preferably, the dipeptidylpeptidase is a serine 
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protease. Preferably, the dipeptidylpeptidase includes an amino acid sequence 
selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID 
NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ 
ID NO:23, SEQ ID NO:24, SEQ ID NO:25, and SEQ ID NO:26. Preferably the 
5 dipeptidylpeptidase is encoded by a nucleic acid including a nucleotide sequence 
SEQIDNO:l. 

In another aspect, the present invention provides an isolated polypeptide 
including an amino acid sequence having a percentage amino acid identity 
greater than about 40% with SEQ ID NO:2. 

10 In another aspect, the present invention provides an isolated nucleic acid 

including a coding sequence encoding a dipeptidylpeptidase, active analog, 
active fragment, or active modification thereof having amidolytic activity for 
cleavage of a peptide bond between the second and third amino acids from the 
N-terminal end of a target polypeptide, wherein the target polypeptide has an 

15 aliphatic or an aromatic residue as a substituent on the a-carbon atom of the 
second amino acid from the N-terminal end of the polypeptide. Preferably the 
nucleic acid includes a nucleotide sequence SEQ ID NO:l. Alternatively, the 
complement of the nucleic acid preferably hybridizes to SEQ ID NO: 1 under 
hybridization conditions of 0.5 M phosphate buffer, pH 7.2, 7% SDS, 10 mM 

20 EDTA, at 68°C, followed by three for 20 minutes washes in 2x SSC, and 0.1% 
SDS, at 65°C, wherein at least about 20 nucleotides of the complement 
hybridize. 

In another aspect, the present invention provides an isolated nucleic acid 
encoding a polypeptide, wherein the polypeptide includes an amino acid 
25 sequence having a percentage amino acid identity greater than about 40% with 
SEQIDNO:2. 

In another aspect, the present invention provides a method of identifying 
an inhibitor of a dipeptidylpeptidase, active analog, active fragment, or active 
modification thereof. The method includes identifying a compound that inhibits 
30 the amidolytic activity of the dipeptidylpeptidase by incubating the 
dipeptidylpeptidase with the compound under conditions that promote 
amidolytic activity of the dipeptidylpeptidase and determining if the amidolytic 
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activity of the dipeptidylpeptidase is inhibited relative to the amidolytic activity 

in the absence of the compound. 

In another aspect, the present invention provides a method of reducing 

growth of a bacterium including inhibiting a dipeptidylpeptidase, active analog, 
5 active fragment, or active modification thereof, by contacting the 

dipeptidylpeptidase with an inhibitor of the dipeptidylpeptidase. Preferably the 

dipeptidylpeptidase is a serine protease. 

In another aspect, the present invention provides a method for protecting 

an animal from a periodontal disease caused by Porphyromonas gingivalis 
10 including administering to the animal an inhibitor of dipeptidylpeptidase, 

wherein the disease is selected from the group consisting of gingivitis and 

periodontitis. Preferably the inhibitor is administered by a method selected from 

the group consisting of subgingival application and controlled release delivery. 
In another aspect, the present invention provides an immunogenic 
15 composition including an isolated dipeptidylpeptidase, an antigenic analog, an 

antigenic fragment, or an antigenic modification thereof having amidolytic 

activity for cleavage of a peptide bond present in a target polypeptide, the 

peptide bond being located between the second and third amino acids from the 

N-terminal end of the target polypeptide, wherein the second amino acid from 
20 the N-terminal end has an aliphatic or an aromatic residue as a substituent on the 

a-carbon atom. Preferably the dipeptidylpeptidase is a serine protease. 

Preferably the second amino acid is selected from the group consisting of 

alanine, phenylalanine, isoleucine, and leucine. The immunogenic composition 

may optionally include an adjuvant. 
25 In another aspect, the present invention provides a composition including 

an inhibitor of an isolated dipeptidylpeptidase and a pharmaceutically acceptable 

carrier. 

Definitions 

"Polypeptide" as used herein refers to a polymer of amino acids and does 
30 not refer to a specific length of a polymer of amino acids. Thus, for example, the 
terms peptide, oligopeptide, protein, and enzyme are included within the 
definition of polypeptide. This term also includes post-expression modifications 
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of the polypeptide, for example, glycosylations, acetylations, phosphorylations, 
and the like. A polypeptide can be produced by an organism, or produced using 
recombinant techniques, or chemically or enzymatically synthesized. 

"Polynucleotide" and "nucleic acid" are used herein interchangeably and 
5 refer to a linear polymeric form of nucleotides of any length, either 

ribonucleotides or deoxynucleotides, and include both double- and single- 
stranded DNA and RNA. A nucleic acid may include both coding and 
non-coding regions that can be obtained directly from a natural source (e.g., a 
microorganism), or can be prepared with the aid of recombinant or synthetic 
10 techniques. A nucleic acid may be equivalent to this nucleic acid or it can 

include, in addition, one or more other polynucleotides. For example, the nucleic 
acid of the invention can be a vector, such as an expression of a coding 
sequence. 

"Peptidase," "proteinase," and "protease" all refer to enzymes that 

15 catalyze the hydrolysis of peptide bonds in a polypeptide. A "peptide bond" or 
"amide boiid" is a coyalent bond between the alpha-amino group of one amino 
acid and the carboxyl group of another amino acid. "Peptidase inhibitor," 
"proteinase inhibitor," "protease inhibitor," and "inhibitor" all refer to 
compounds that inhibit a peptidase that catalyzes the hydrolysis of peptide bonds 

20 in a polypeptide. 

"Serine protease" refers to an enzyme that uses the hydroxy-functional 
side chain of serine as a nucleophile in a catalytic reaction. 

"Amidolytic activity" refers to the ability of a polypeptide to catalyze the 
hydrolysis of at least one peptide bond in a polypeptide. The term "cleavage" can 

25 also be used to refer to the hydrolysis of a peptide bond in a polypeptide. A 

"dipeptidylpeptidase" is able to hydrolyze the peptide bond between the second 
and third amino acids from the N-terminal end of a target polypeptide including 
the general formula H-Xaa-Yaa-Xaa-, wherein Xaa is a natural or modified 
amino acid, and Yaa is an amino acid including an aliphatic or an aromatic 

30 residue as a substituent on the a-carbon atom. Preferred amino acids in the Yaa 
position include alanine, phenylalanine, isoleucine, and leucine. 
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A "target polypeptide" is a polypeptide that is the potential substrate of 
the amidolytic activity of a dipeptidylpeptidase. A "dipeptidylpeptidase" does 
not have to cleave all members of the target polypeptide. The term "natural 
amino acid" refers to the 20 amino acids typically produced by a cell. The term 
5 "modified amino acid" refers to, for instance, acetylation, hydroxylation, 
methylation, amidation, or the attachment of carbohydrate or lipid moieties, 
cofactors, and the like. 

As used herein, the term "isolated" means that a polypeptide or a 
polynucleotide has been either removed from its natural environment, produced 
10 using recombinant techniques, or chemically or enzymatically synthesized. 
Preferably, the polypeptide or polynucleotide is purified, i.e., essentially free 
from any other polypeptides, polynucleotides, and associated cellular products or 
other impurities. 

An active analog, active fragment, or active modification of a 
15 polypeptide of the invention is one that has amidolytic activity by hydrolysis of a 
peptide bond present in the target polypeptide as described herein. Active 
analogs, active fragments, and active modifications are described in greater 
detail herein. 

An antigenic analog, antigenic fragment, or antigenic modification of a 
20 polypeptide of the invention is one that has amidolytic activity by hydrolysis of a 
peptide bond present in the target polypeptide as described herein. Antigenic 
analogs, antigenic fragments, and antigenic modifications are described in 
greater detail herein. 

"Percentage amino acid identity" refers to a comparison of the amino 
25 acids of two polypeptides as described herein. 

As used herein, "aliphatic residue" means an organic radical having 
carbon atoms linked in open chains. 

As used herein, "aromatic residue" means an organic radical that includes 
an aromatic ring (e.g., an aromatic group, an alkaryl group, or an aralkyl group). 
30 As used herein, the "PI" position of a polypeptide is the amino acid on 

the N-terminal end of the scissile bond that is being cleaved. For 
dipeptidylpeptidases that cleave the peptide bond between the second and third 
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amino acids from the N-terminal end of a target polypeptide, the PI position is 
the second amino acid from the N-terminal end of the target polypeptide (i.e., the 
penultimate position). 

5 BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 depicts a plot of the absorbance at 280 nm (A) and amidolytic 
activity against Ala-Phe-pNA (•) for the purification of P. gingvivalis 
dipeptidylpeptidase (DPP-7) from the acetone precipitate of the P. gingivalis cell 
extract. The straight solid lines indicate gradients in the eluting composition. 

10 Figure 1(a) illustrates the separation of DPP-7 on hydroxyapatite (100 ml) 
equilibrated with 20 mM potassium phosphate buffer, pH 7.0, and using a 
potassium phosphate gradient from 20 mM to 300 mM. Figure 1(b) illustrates 
the separation of DPP-7 obtained from the previous step on Phenyl-Sepharose 
HP (25 ml) equilibrated with 50 mM potassium phosphate, 1M ammonium 

15 sulfate, pH 7.0, at a flow rate of 30 ml/hour, and using an ammonium sulfate 
gradient from 0.4M to 0M. Figure 1(c) illustrates the separation of DPP-7 on a 
MonoS FPLC column using a sodium chloride gradient from 0M to 0.3M then 
from0.3M to 1M. 

Figure 2 is a depiction of the SDS-PAGE of fractions obtained during the 
20 purification of P. gingivalis DPP-7 with Lane A representing molecular mass 
markers (phosphorylase B, 97 kDa; bovine serum albumin, 68 kDa; ovalbumin, 
43 kDa; carbonic anhydrase, 30 kDa; soybean trypsin inhibitor, 20 kDa; oc- 
lactalbumin, 14 kDa); Lane B representing acetone precipitate from Triton X- 
100 extract of P. gingivalis; Lane C representing hydroxyapatite column eluate; 
25 Lane D representing Phenyl-Sepharose column eluate; and Lane E representing 
MonoS column eluate. 

Figure 3 depicts a plot of the DPP-7 activity against Ala-Phe-pNA vs. 
pH. Enzyme activity was tested on Ala-Phe-pNA substrate in different buffers 
including: HEPES (•); PIPES (□); potassium phosphate (■ ); Tris (o); and MES 
30 (A). 

Figure 4 depicts the coding sequence (SEQ ID NO:l) encoding P. 
gingivalis DPP-7 (SEQ ID NO:2). Sequences obtained from the Edman 
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degradation of the trypsin fragmented DPP-7 polypeptide chain are underlined. 
The putative active site serine residue is marked by the black background. 

Figure 5 is a listing of sequences comparing the C-terminal regions of the 
P. gingivalis DPP-7 (residues 664-695; SEQ ID NO:3) and 5. aureus V8 
5 endopeptidase (residues 704-863; SEQ ID NO:4). Common residues are 

indicated by the single letter amino acid in the line between the two sequences. 
The "+" symbol in the line between the two sequences indicates similar residues. 

Figure 6 depicts a multiple sequence alignment of P. gingivalis DPP-7 
and its putative homologues. Sequences of DPP-7 related proteinases were 

10 obtained from the conceptual translation of the following ORFs retrieved from 
unfinished and finished genomes databases (available at www.tigr.org): Sl- 
Shewanella putrefaciens gnl | TIGRJ24 1 sputre 6401 (SEQ ID NO:5); S2- 
Shewanella putrefaciens gnl | TIGR_24 1 sputre 6410 (SEQ ID NO:6); X- 
Xylellafastidiosa gb | AE004008. 1 1 (SEQ ID NO:7); PI- Porphyromonas 

15 gingivalis gnl | TIGR | P. gingivalis JCPG, con (SEQ ID NO:8); P2- P. gingivalis 
DPP-7 gnl | TIGR | P. gingivalis JCPG. con (SEQ ID NO:9). The sequences 
were subsequently aligned using the ClustalW multiple sequence alignment tool. 



20 DETAILED DESCRIPTION OF THE PREFERRED 

EMBODIMENTS 

The present invention provides isolated polypeptides, preferably isolated 
dipeptidylpeptidases, that have amidolytic activity by hydrolysis of a peptide 
bond present in a target polypeptide, wherein the bond is between the second and 

25 third amino acid from the N-terminus of the peptide. The dipeptidylpeptidase 
has amidolytic activity by hydrolysis of a peptide bond present in a target 
polypeptide including the fragment H-Xaa-Yaa-Xaa-, wherein Xaa is a natural or 
modified amino acid, Yaa is an amino acid including an aliphatic or an aromatic 
residue as a substituent on the a-carbon atom, and the peptide bond of the target 

30 polypeptide that is hydrolyzed is the bond between the second and third amino 
acids from the N-terminus of the peptide. In increasing order of preference, 
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isolated polypeptides can cleave a target polypeptide that is at least about 5 
amino acids or at least about 400 Da, at least about 10 amino acids or at least 
about 750 Da, at least about 20 amino acids or at least about 1,500 Da, or at least 
about 30 amino acids or at least about 3,000 Da. Preferably, the 
5 dipeptidylpeptidases cleave peptides including a sequence of H-Xaa-Yaa-Xaa-, 
wherein Yaa is alanine, phenylalanine, isoleucine, or leucine. More preferably, 
the dipeptidylpeptidases cleave peptides including a sequence of SEQ ID NO: 10, 
SEQ ID NO: 1 1 ; SEQ ID NO: 12; or SEQ ID NO: 1 3 as shown in Table 3. 
The polypeptides disclosed in the present application are preferably 

10 dipeptidylpeptidases. Preferably, the dipeptidylpeptidase is isolated from 
Porphyromonas gingivalis. Preferably, the dipeptidylpeptidase is a serine 
protease. Most preferably, the dipeptidylpeptidase is P. gingivdlis 
dipeptidylpeptidase-7 (DPP-7). The polypeptides can be used as a source of 
antibodies for inhibiting the amidolytic activity and thereby possibly reducing 

15 periodontitis, loss of tooth attachment and periodontal pocket formation. 

Antibodies to dipeptidylpeptidases can also be used to identify and/or isolate 
additional dipeptidylpeptidases. Knowledge of dipeptidylpeptidases can also be 
used to make inhibitors of dipeptidylpeptidases and to make immunogenic 
compositions that could be used to elicit the production of antibodies to 

20 dipeptidylpeptidases and thereby possibly reduce gingivitis, periodontitis, loss of 
tooth attachment, and/or periodontal pocket formation. 

Dipeptidylpeptidase-7, either alone or in a mixture with other 
dipeptidylpeptidases, can be used to generate a pool of dipeptides from 
polypeptides. Dipeptides may be preferably imported by cells. Thus, pools of 

25 didpeptides might be useful substrates for transport. 

Dipeptidylpeptidase-7 (DPP-7) was purified from the membrane fraction 
of Porphyromonas gingivalis. This enzyme, preferably having an apparent 
molecular mass of about 76 kDa, has specificity for polypeptides having either 
an aliphatic or an aromatic residue as a substituent on the a-carbon atom of the 

30 second amino acid from the N-terminal end of the polypeptide. Although it 
belongs to the serine class of peptidases, it does not resemble other known 
dipeptidylpeptidases. Interestingly, the amino acid sequence around the putative 
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active site serine residue shows significant homology to the C-terminal region of 
the Staphylococcus aureus V-8 endopeptidase. In P. gingivalis, DPP-7 probably 
serves nutritional functions by providing dipeptides to this assaccharolytic 
bacterium. 

5 Several studies indicate that the outer membrane of P. gingivalis contains 

a complex, proteolytic machinery which serves multiple physiological functions. 
The present application discloses the identification of a novel proteinase 
localized on the bacterial surface. 

The purified enzyme migrated as a single band of about 76 kDa on SDS- 

10 PAGE and its amino-terminal sequence was located within the primary structure 
of the translated product of the dpp-7 coding sequence. Apparently, the enzyme 
is truncated at the amino terminus (i.e., amino acid 24 of SEQ ID NO:2 is the 
first amino acid of the truncated form) due to the action of a lysine specific 
proteinase, most likely gingipain K. Taking into account that the N-terminus of 

15 DPP-7 contains membrane anchorage domains it is likely that the N-terminal 
truncation noted here occurred during the isolation procedure and may not 
represent its true membrane form. 

The dipeptidylpeptidases of the present invention are preferably serine 
proteases that are inhibited by serine protease inhibitors. The 

20 dipeptidylpeptidases of the present invention are preferably inhibited by serine 
protease inhibitors including, for example, diisopropylfluorophosphate (DFP), 4- 
(2-aminoethyl)-benzenesulfonyl fluoride hydrochloride (PEFABLOCK), and 
phenylmethanesulfonyl fluoride (PMSF). In addition, the dipeptidylpeptidases 
of the present invention are resistant to sulfhydryl group blocking reagents and 

25 chelating agents, which is also consistent with the enzyme being a serine 

protease. However, the P. gingivalis DPP-7 does not belong to any of the six 
previously described types of dipeptidylpeptidases (Barrett et aL, Handbook of 
Proteolytic Enzymes, Academic Press, London (1998)). DPP-I is a member of a 
cysteine class of peptidases and possesses a broad specificity, but has an 

30 exclusion for basic amino acid and proline residues in the PI site of the scissile 
peptide bond (McGuire et aL, Arch. Biochem. Biophys., 295:280-88 (1992)). 
DPP- VI is another representative of the cysteine proteinase family with 
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dipeptidylpeptidase activity towards broad spectrum of substrates (Vacheron et 
aL, Eur. J. Biochem., 100:189-96 (1979)). DPP-II, DPP-IV and DPP-V belong 
to the S9 family of the serine proteases (Barrett et aL, Handbook of Proteolytic 
Enzymes, Academic Press, London (1998)). Both DPP-II and DPP-IV share 
5 similar specificity directed against Pro and Ala residues in the penultimate 

position whereas DPP-V is an enzyme secreted by Aspergillus fumigatus with a 
unique substrate specificity limited to X-Ala, His-Ser, and Ser-Tyr dipeptides 
(Beauvais et aL, /. Biol Chem., 272:6238-44 (1997)). DPP-HI is also classified 
as a serine peptidase, with its action being restricted to Arg residue in the PI 

10 position (Ellis et aL, 7. Biol Chem., 242:4623-29 (1967)). In terms of 

biochemical features, DPP-7 resembles a dipeptidyl aminopeptidase (DAP-BE), 
which was isolated from Pseudomonas sp. strain W024, but the coding 
sequence of that enzyme remains unknown and does not allow a sequence 
comparison of these proteins (Ogasawara et aL, /. Bacteriol, 178:6288-95 

15 (1996)). Because P. gingivalis dipeptidylpeptidase does not exhibit any 

significant homology to any of the dipeptidylpeptidases described so far, this 
enzyme has been designated DPP-7. 

Interestingly the P. gingivalis DPP-7 displays the consensus sequence 
characteristic for the catalytic site of the V-8 like proteases, a group of 

20 endopeptidases cleaving after glutamic or aspartic acid residues (Carmona et aL, 
Nucleic Acids Res., 15:6757 (1987)). This region of homology is specifically 
located only at the C-terminal region of both proteases and includes the putative 
active site serine residue. Interestingly, more coding sequences encoding 
putative, DPP-7 related proteases in P. gingivalis, Xylella fastidiosa and 

25 Shewanella putrefaciens were identified. Based on the enzymological and 

coding sequence data presented above, the P. gingivalis DPP-7 does not belong 
to any of the peptidase families previously reported and should, therefore, be 
regarded as a prototype enzyme that defines a new family of 
dipeptidylpeptidases. 

30 The invention further includes a polypeptide, preferably a 

dipeptidylpeptidase, that shares a significant level of primary structure (referred 
to as "percent identity") with SEQ ID NO:2. The level of identity is determined 

11 
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by aligning the two amino acid sequences (i.e., the amino acid sequence of the 
polypeptide and the sequence SEQ ID NO:2) such that the residues that make up 
the putative active site sequence (e.g., about amino acid 644 to about 653, 
preferably about amino acid 644 to about 658) are in register, then further 
5 aligned to maximize the number of amino acids that they have in common along 
the lengths of their sequences; gaps in either or both sequences are permitted in 
making the alignment in order to place the residues that make up the putative 
active site sequence (e.g., about amino acid 644 to about 653, preferably about 
amino acid 644 to about 658) in register and to maximize the number of shared 

10 amino acids, although the amino acids in each sequence must nonetheless remain 
in their proper order. Preferably, two amino acid sequences are compared using 
the blastp program of the BLAST search algorithm, which is described by 
Altshul et al., (NucL Acids Res., 25, 3389-3402 (1997)), and available at the 
National Center for Biotechnology Information (e.g., 

15 www.ncbi.nlm.nih.gov/Microb_blast/unfinishedgenome.html or 

www.ncbi.nlm.nih.gov/BLAST/). Preferably, the default values for all BLAST 
search parameters are used. In the comparison of two amino acid sequences 
using the BLAST search algorithm, structural similarity is referred to as 
"identities." Preferably, a dipeptidylpeptidase has, in increasing order of 

20 preference, at least about 40% identity, at least about 50% identity, at least about 
60% identity, at least about 70% identity, at least about 80% identity, and most 
preferably at least about 90% identity with SEQ ID NO:2. Preferably, about 
amino acid 543 to about 699 of SEQ ID NO:2 are used, more preferably about 
amino acid 71 to about 712 of SEQ ID NO:2 are used. Preferably the invention 

25 includes an isolated polypeptide including an amino acid sequence having a 
percentage amino acid identity of greater than about 40% with SEQ ID NO:2. 

In general, the amidolytic activity of the polypeptides of the invention, 
preferably dipeptidylpeptidases, can be measured by assay of the cleavage of a 
target polypeptide in the presence of dipeptidylpeptidase and a buffer. 

30 Preferably, the ratio of dipeptidylpeptidase to target polypeptide is at least about 
1:1; more preferably at least about 1:100; even more preferably at least about 
1 : 1 ,000; and most preferably at least about 1 : 1 0,000. Preferably, the ratio of 
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dipeptidylpeptidase to target polypeptide is at most about 1:10,000,000; more 
preferably at most about 1: 1,000,000; and most preferably at most about 
1:100,000. Buffers in which a dipeptidylpeptidase is active are suitable for the 
assay. Preferably, the buffer is at most about 200 mM HEPES (4-(2- 
5 hydroxyethyl)-l-piperazineethanesulfonic acid), more preferably at most about 
50 mM HEPES, and most preferably at most about 20 mM HEPES. Preferably, 
the pH of the buffer is at least about pH 6.0. Preferably, the pH of the buffer is 
at most about pH 8.0 and more preferably at most about pH 7.5. Preferably, the 
temperature of the assay is about 37°C. The assay can be carried out for at least 

10 about 1 minute to at most about 24 hours. Preferably, the amidolytic activity of 
the dipeptidylpeptidases are measured at a dipeptidylpeptidase:target polypeptide 
ratio of at least about 1: 100 and at most about 1: 1,000,000 in, about 200 mM 
HEPES, about pH 7.5 at about 37°C for at least about 3 hours. In general, the 
time of the assay can vary depending on the substrate and enzyme:substrate 

15 ratio. Typically, target polypeptides are stable under these conditions, and 

typically it is difficult to detect background levels of hydrolysis in the absence of 
a dipeptidylpeptidase. Preferably, the assay is allowed to continue until at least 
about 1% of the target polypeptide is hydrolyzed. 

Dipeptidylpeptidases of the present invention are preferably inhibited by 

20 compounds including, for example, 4-(2-aminoethyl)-benzenesulfonyl fluoride 
hydrochloride (PEFABLOCK); diisopropylfluorophosphate (DFP); 
phenylmethanesulfonyl fluoride (PMSF); 3,4-dichloisocoumarin; and 
combinations thereof. The peptidases of the present invention are preferably not 
inhibited by a compounds including, for example, specific inhibitors of metallo 

25 peptidases, cysteine peptidases, and aspartic peptidases. 

An active analog, active fragment, or active modification of a 
polypeptide including the amino acid sequence SEQ ID NO:2 is one that has 
amidolytic activity by hydrolysis of the target polypeptide described above. 
Active analogs of a polypeptide including the amino acid sequence SEQ ID 

30 NO:2 include dipeptidylpeptidases having amino acid substitutions that do not 
eliminate hydrolysis of the target polypeptide at the peptide bond between the 
second and third amino acids. Substitutes for an amino acid may be selected 
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from other members of the class to which the amino acid belongs. For example, 
nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, 
proline, phenylalanine, and tryptophan. Polar neutral amino acids include 
glycine, serine, threonine, cysteine, tyrosine, asparagine and glutamine. The 
5 positively charged (basic) amino acids include arginine, lysine and histidine. 
The negatively charged (acidic) amino acids include aspartic acid and glutamic 
acid. Examples of preferred conservative substitutions include Lys for Arg and 
vice versa to maintain a positive charge; Glu for Asp and vice versa to maintain 
a negative charge; Ser for Thr so that a free -OH is maintained; and Gin for Asn 

10 to maintain a free NH2. 

Active fragments of a dipeptidylpeptidase of the invention include 
dipeptidylpeptidases containing deletions or additions of one or more contiguous 
or noncontiguous amino acids such that the resulting polypeptide will hydrolyze 
the target polypeptide at the bond between the second and third amino acids. 

15 Modified dipeptidylpeptidases include dipeptidylpeptidases that are chemically 
and enzymatically derivatized at one or more constituent amino acid, including 
side chain modifications, backbone modifications, and N- and C- terminal 
modifications including acetylation, hydroxylation, methylation, amidation, and 
the attachment of carbohydrate or lipid moieties, cofactors, and the like. 

20 Modified dipeptidylpeptidases will hydrolyze the target polypeptide at the 
peptide bond between the second and third amino acids. 

Preferably, a dipeptidylpeptidase includes the sequence TGGNSGSPV 
(SEQ ID NO:26), and more preferably includes the consensus sequence for the 
active-site serine residue of serine type proteases, TGGNSGSPVF (SEQ ID 

25 NO:25), where T is Threonine, G is glycine, N is Asparagine, P is Proline, V is 
valine, F is Phenylalanine, and S is serine, with the putative active site serine 
being underlined. The active site serine can be identified by, for instance, 
labeling with diisopropylfluorophosphate as described herein. Preferably, the 
catalytic domain of the dipeptidylpeptidases of the invention begins at about 

30 residue 543 of SEQ ID NO:2 and includes the remaining 169 amino acids, more 
preferably begins at about residue 540 of SEQ ID NO:2 and includes the 
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remaining 172 amino acids, and most preferably begins at about residue 522 of 
SEQ ID NO:2 and includes the remaining carboxy-terminal amino acids. 

Dipeptidylpeptidases can be obtained by several methods. Isolation of a 
dipeptidylpeptidase present on the surface of a cell producing the peptidase 

5 typically requires lysis of the cell followed by purification methods that are well 
known in the art. Alternatively, cells can be treated with a detergent, for 
instance Triton X-100, to remove the peptidase from the cell surface. The 
following are nonlimiting examples of suitable protein purification procedures: 
fractionation on imrnunoaffinity, ion-exchange, hydroxyapatite, Phenyl- 

10 Sepharose HP, MonoQ HR 5/5, or MonoP columns; ethanol precipitation; 

reverse phase HPLC; chromatography on silica or on an ion-exchange resin such 
as DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel 
filtration using, for example, Sephadex G-75. Preferably, isolation of a 
dipeptidylpeptidase from P. gingivalis is accomplished using a combination of 

15 hydroxyapatite, Phenyl-Sepharose HP, MonoS HR 5/5 and MonoP column 
chromatography steps as described herein. 

Dipeptidylpeptidases can also be isolated from organisms other than P. 
gingivalis. Other organisms can express a dipeptidylpeptidase that is encoded by 
a coding region having similarity to the coding region encoding SEQ ID NO:2. 

20 A "coding region," a "coding sequence," or an "open reading frame" (ORF) is a 
linear form of nucleotides that encodes a polypeptide, usually via mRNA, when 
placed under the control of appropriate regulatory sequences. The boundaries of 
a coding region are generally determined by a translation start codon at its 5' end 
and a translation stop codon at its 3 5 end. "Regulatory region" refers to a nucleic 

25 acid that regulates expression of a coding region to which a regulatory region is 
operably linked. Non limiting examples of regulatory regions include 
promoters, transcription initiation sites, translation start sites, translation stop 
sites, and terminators. "Operably linked" refers to a juxtaposition wherein the 
components so described are in a relationship permitting them to function in 

30 their intended manner. A regulatory element is "operably linked" to a coding 
region when it is joined in such a way that expression of the coding region is 
achieved under conditions compatible with the regulatory region. Alternatively, 
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other organisms can express a dipeptidylpeptidase from a recombinant coding 
region encoding the peptidase. The identification of similar coding regions in 
other organisms can be accomplished as described herein. A dipeptidylpeptidase 
can be isolated using purification methods that are well known in the art. 
5 Alternatively, the peptidase can be chemically synthesized using methods that 
are well known in the art including, for instance, solid phase synthesis. 
Examples of, for instance, coding and regulatory regions are described herein. 

The expression of a dipeptidylpeptidase by an organism other than P. 
gingivalis can be detected using specific substrates of the general formula 

10 Xaa-Xaa-LG, wherein Xaa represents any natural amino acid and LG is a 

leaving group. The leaving group can be a chromogenic or fluorogenic group 
known to the art. The expression of a dipeptidylpeptidase by an organism and 
subsequent cleavage of a specific substrate results in a free amino acid or a free 
leaving group, each of which can be assayed using techniques known to those of 

15 skill in the art. Other methods can be based on immunogenic properties of DPP- 
7, for instance immunoassays and histochemistry, the detection of mRNA, and 
PCR related methods, all of which are known to one of skill in the art. 

In one aspect, the present invention is directed to a nucleic acid encoding 
a polypeptide, particularly a dipeptidylpeptidase, active analog, active fragment, 

20 or active modification thereof. The nucleic acid can have a nucleotide sequence 
as shown in SEQ ID NO;l. Alternatively, nucleic acids of the invention include 
those whose complement hybridize to SEQ ID NO: 1 under standard 
hybridization conditions as described herein. During hybridization the entire 
nucleotide sequence of the complement can hybridize with SEQ ID NO: 1. 

25 Preferably, at least about 20 nucleotides of the complement hybridize with SEQ 
ID NO:l, more preferably at least about 50 nucleotides, most preferably at least 
about 100 nucleotides. 

The identification of similar coding regions in other organisms can be 
accomplished by screening individual wild-type microorganisms for the 

30 presence of nucleotide sequences that are similar to the coding region of DPP-7, 
which is shown in SEQ ID NO:l. Screening methods include, for instance, 
hybridization of a detectably labeled probe with a nucleic acid. 
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Standard hybridizing conditions are a modification of the conditions used 
by Church et al. ((1984) Proa Natl Acad ScL USA 81, 1991): 0.5 M phosphate 
buffer, pH 7.2, 7% SDS, 10 mM EDTA, at 68°C, and three washes, each for 20 
minutes in 2x SSC (preferably 0.1 SSC), 0.1% SDS, at 65°C. Preferably, a 
5 probe will hybridize to the nucleotide sequence set forth in SEQ ID NO: 1 under 
standard hybridizing conditions. Generally the probe does not have to be 
complementary to all the nucleotides of the nucleic acid as long as there is 
hybridization under the above-stated conditions. 

"Complement" and "complementary" refer to the ability of two single 

10 stranded nucleic acids to base pair with each other, where an adenine on one 
nucleic acid will base pair to a thymine on a second nucleic acid and a cytosine 
on one nucleic acid will base pair to a guanine on a second nucleic acid. Two 
nucleic acids are complementary to each other when a nucleotide sequence in 
one nucleic acid can base pair with a nucleotide sequence in a second nucleic 

15 acid. For instance, 5 -ATGC and 5 -GC AT are complementary. The term 

complement and complementary also encompasses two nucleic acids where one 
nucleic acid contains at least one nucleotide that will not base pair to at least one 
nucleotide present on a second nucleic acid. For instance the third nucleotide of 
each of the two nucleic acids 5-ATTGC and 5-GCTAT will not base pair, but 

20 these two nucleic acids are complementary as defined herein. Typically two 

nucleic acids are complementary if they hybridize under the standard conditions 
referred to herein. 

Preferred probes are nucleic acids complementary to a coding region or 
another nucleotide sequence that encodes a dipeptidylpeptidase. For instance, a 

25 probe can include a consecutive series of nucleotides complementary to a portion 
of SEQ ID NO: 1. Preferably a probe is at least about 18 bases, more preferably 
at least about 21 bases, and most preferably at least about 24 bases in length. 
One of skill in the art could select useful probes as desired. Methods of 
detectably labeling a probe are well known to the art. 
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The nucleic acid that is identified by the probe is farther analyzed to 
determine if it encodes a polypeptide with amidolytic activity of the peptide 
bond between the second and third amino acids from the N-terminus on a target 
polypeptide of the general formula H-Xaa-Yaa-Xaa, wherein Xaa is a natural or 
5 modified amino acid and Yaa is an amino acid including an aliphatic or an 
aromatic residue as a substituent on the a-carbon atom. Another method for 
screening individual microorganisms for the presence of nucleotide sequences 
that are similar to the coding regions of the present invention is the polymerase 
chain reaction (PCR). 

10 Individual wild-type microorganisms containing nucleic acids encoding a 

dipeptidylpeptidase can also be identified using antibody. Preferably the 
antibody is directed to DPP-7. The production of antibodies to a particular 
polypeptide is known to a person of skill in the art, and is further detailed herein. 
The use of hybridization of a probe to a coding region present in 

15 individual wild-type microorganisms can be used as a method to identify a 
coding region identical or similar to a coding region present in SEQ ID NO:l. 
The coding region can then be isolated and ligated into a vector as described 
below. 

The present invention is also directed to coding regions sharing a 
20 significant level of primary structure with the coding region present at SEQ ID 
NO: 1. The level of identity is determined by aligning the two nucleotide 
sequences (i.e., the nucleotide sequence of the polynucleotide and the sequence 
SEQ ID NO:l) such that the residues that encode the putative active site of the 
encoded protein (e.g., about nucleotide 1929 to about 1974) are in register, then 
25 further aligned to maximize the number of nucleotides that they have in common 
along the lengths of their sequences; gaps in either or both sequences are 
permitted in making the alignment in order to place the residues that encode the 
putative active site of the encoded protein (e.g., about nucleotide 1929 to about 
1974) in register and to maximize the number of shared nucleotides, although the 
30 nucleotides in each sequence must nonetheless remain in their proper order. 

Preferably, two nucleotide sequences are compared using the blastn program of 
the BLAST search algorithm, which is described by Altshul et al., (NucL Acids 
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Res., 25, 3389-3402 (1997)), and available at the National Center for 
Biotechnology Information (e.g., 

www.ncbi.nlm.nih.gov/Microb__blast/unfinishedgenome.html or 
www.ncbi.nlm.nih.gov/BLAST/). Preferably, the default values for all BLAST 
5 search parameters are used. In the comparison of two nucleotide sequences 
using the BLAST search algorithm, structural similarity is referred to as 
"identities." Preferably, two nucleotide acid sequences have, in increasing order 
of preference, preferably at least about 70%, at least about 80%, at least about 
90%, at least about 95%, and most preferably at least about 95% identity. 

10 As mentioned above, a nucleic acid of the invention can be inserted in a 

vector. Construction of vectors containing a nucleic acid of the invention 
employs standard ligation techniques known in the art. See, e.g., Sambrook et 
al, Molecular Cloning: A Laboratory Manual., Cold Spring Harbor Laboratory 
Press (1989) or Ausubel, R.M., ed. Current Protocols in Molecular Biology 

15 (1994). A vector can provide for further cloning (amplification of the nucleic 
acid), i.e., a cloning vector, or for expression of the polypeptide encoded by the 
coding region, i.e., an expression vector. The term vector includes, but is not 
limited to, plasmid vectors, viral vectors, cosmid vectors, or artificial 
chromosome vectors. Typically, a vector is capable of replication in a bacterial 

20 host, for instance E. colt Preferably the vector is a plasmid. 

Selection of a vector depends upon a variety of desired characteristics in 
the resulting construct, such as a selection marker, vector replication rate, and 
the like. Suitable plasmids for expression in E. coli, for example, include 
pUC(X), pKK223-3, pKK233-2, P Trc99A, and pET-(X) wherein (X) denotes a 

25 vector family in which numerous constructs are available. pUC(X) vectors can 
be obtained from Pharmacia Biotech (Piscataway, NH) or Sigma Chemical Co. 
(St. Louis, MO). pKK223-3, pKK233-2 and pTrc99A can be obtained from 
Pharmacia Biotech. pET-(X) vectors can be obtained from Promega (Madison, 
WI) Stratagene (La Jolla, CA) and Novagen (Madison, WI). To facilitate 

30 replication inside a host cell, the vector preferably includes an origin of 
replication (known as an "ori") or replicon. For example, ColEl and P15A 
replicons are commonly used in plasmids that are to be propagated in E. colL 
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An expression vector optionally includes regulatory regions operably 
linked to the coding region. The invention is not limited by the use of any 
particular promoter, and a wide variety are known. Promoters act as regulatory 
signals that bind RNA polymerase in a cell to initiate transcription of a 
5 downstream (3' direction) coding region. The promoter used in the invention 
can be a constitutive or an inducible promoter. It can be, but need not be, 
heterologous with respect to the host cell. Preferred promoters for bacterial 
transformation include lac, tacUV5, tac, trc, T7, SP6 and ara. 

An expression vector can optionally include a Shine Dalgarno site (e.g., a 

10 ribosome binding site), and a start site (e.g., the codon ATG) to initiate 

translation of the transcribed message to produce the enzyme. It can also include 
a termination sequence to end translation. A termination sequence is typically a 
codon for which there exists no corresponding aminoacetyl-tRNA, thus ending 
polypeptide synthesis. The nucleic acid used to transform the host cell can 

15 optionally further include a transcription termination sequence. The rrnB 

terminators, which is a stretch of DNA that contains two terminators, Tl and T2, 
is an often used terminator that is incorporated into bacterial expression systems 
(J. Brosius et al., (1981) 7. Mol Biol 148 107-127). 

The nucleic acid used to transform the host cell optionally includes one 

20 or more marker sequences, which typically encode a polypeptide that inactivates 
or otherwise detects or is detected by a compound in the growth medium. For 
example, the inclusion of a marker sequence can render the transformed cell 
resistant to an antibiotic, or it can confer compound-specific metabolism on the 
transformed cell. Examples of a marker sequence are sequences that confer 

25 resistance to kanamycin, ampicillin, chloramphenicol, and tetracycline. 

Antibodies to a polypeptide including the sequence SEQ ID NO: 2 can be 
produced. Alternatively, antibodies to an antigenic analog, antigenic fragment, 
or antigenic modification of a polypeptide including the sequence SEQ ID NO:2 
can be made. An antigenic analog, antigenic fragment, or antigenic modification 

30 of a polypeptide including the amino acid sequence SEQ ID NO:2 is one that 
generates an immune response in an animal. Preferably, an antigenic analog, 
antigenic fragment, or antigenic modification has amidolytic activity. Antigenic 
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analogs of a polypeptide including the amino acid sequence SEQ ID NO:2 
include dipeptidylpeptidases having amino, acid substitutions that do not 
eliminate peptide antigenicity in an animal. Substitutes for an amino acid may 
be selected from other members of the class to which the amino acid belongs, as 
5 described herein. Fragments of a dipeptidylpeptidase of the invention include 
dipeptidylpeptidases containing deletions or additions of one or more contiguous 
or noncontiguous amino acids such that the resulting polypeptide will generate 
an immune response in an animal. Modified dipeptidylpeptidases include 
dipeptidylpeptidases that are chemically and enzymatically derivatized at one or 

10 more constituent amino acids, including side chain modifications, backbone 
modifications, and N- and C- terminal modifications including acetylation, 
hydroxylation, methylation, amidation, and the attachment of carbohydrate or 
lipid moieties, cofactors, and the like. 

Accordingly, an aspect of the invention is an immunogenic composition 

15 including an isolated dipeptidylpeptidase, an antigenic analog, antigenic 

fragment, or antigenic modification thereof. The dipeptidylpeptidase preferably 
has amidolytic activity for cleavage of the target polypeptide described herein. 

The immunogenic composition can further include excipients or diluents 
that are pharmaceutically acceptable as carriers and compatible with the 

20 immunogenic composition. The term "pharmaceutically acceptable carrier" 
refers to a carrier(s) that is "acceptable" in the sense of being compatible with 
the other ingredients of a composition and not deleterious to the recipient 
thereof. Suitable excipients include, for example, water, saline, dextrose, 
glycerol, ethanol, and combinations thereof. In addition, if desired, the 

25 immunogenic composition may contain minor amounts of auxiliary substances 
such as wetting or emulsifying agents, pH buffering agents, and/or adjuvants 
which enhance the effectiveness of the immune-stimulating composition. 

The immunogenic composition can be used in a method for protecting an 
animal from a disease caused by P. gingivalis. This method includes 

30 administering the immunogenic composition and eliciting antibodies to a 
dipeptidylpeptidase, antigenic analog, antigenic fragment, or antigenic 
modification. The diseases that can be treated in this manner include 
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periodontal diseases, which include gingivitis and periodontitis. Clinical 
hallmarks of periodontitis include loss of tooth attachment and periodontal 
pocket formation. 

Alternatively and preferably, periodontal diseases can be treated by the 
5 use of inhibitors of a dipeptidylpeptidase. An inhibitor of a dipeptidylpeptidase 
can be present in a composition that preferably contains a pharmaceutically 
acceptable carrier. For instance, inhibitors can be applied systemically, 
subgingivally (e.g., subgingival irrigation), and/or by controlled release delivery 
directly into the periodontal pocket using methods well known in the art (see, 

10 e.g., Kornman, X Periodontol 64:782-91 (1993). Preferably, an inhibitor is 
applied subgingivally or by controlled release delivery. 

The dipeptidylpeptidases, active analogs, active fragments, and active 
modifications thereof can be used in a method of reducing growth of bacteria in 
vitro or in vivo. Preferably, the bacteria is a periodontal pathogen, i.e, a bacterial 

15 pathogen that causes periodontal disease, more preferably the bacteria is P. 
gingivalis. The inability of asaccharolytic P. gingivalis to utilize free amino 
acids makes the bacterium entirely dependant on an external peptide supply. 
The action of the polypeptides of the invention may be required for bacterial 
growth, and inhibition of the polypeptides of the invention may inhibit the in 

20 vivo growth of organisms, including P. gingivalis. The method includes 

decreasing the amount of dipeptides (e.g., the result of cleavage of the target 
polypepitde by a dipeptidylpeptidase) and the amount of free amino acids that 
result from further cleavage of the dipeptides present by inhibiting a 
dipeptidylpeptidase, active analog, active fragment, or active modification 

25 thereof, such that the amount of dipeptides generated by the polypeptides is 
decreased. The amount of dipeptides is decreased relative to the amount of 
dipeptides present in the absence of the inhibitor. Preferably, the amount of 
dipeptides generated is decreased by an inhibitor, a monoclonal antibody that 
inhibits the dipeptidylpeptidase, or polyclonal antibodies that inhibit the 

30 dipeptidylpeptidase, more preferably, the amount of dipeptides generated is 

decreased by an inhibitor. Preferably, an inhibitor acts to inhibit a polypeptide of 
the invention, preferably a dipeptidylpeptidase, by blocking the active site of the 

22 



WO 02/38742 



PCT/US01/46782 



polypeptide. The polypeptide can be present on the surface of the bacteria or 
secreted into the environment, preferably the polypeptide is present in the 
surface of the bacteria. 

The present invention is also directed to a method of developing an 
5 inhibitor of a dipeptidylpeptidase, active analog, active fragment, or active 
modification thereof, preferably a dipeptidylpeptidase. The method includes 
identifying a compound that inhibits the amidolytic activity of the 
dipeptidylpeptidase. Such compounds include, for example, polypeptides, 
organic compounds, inorganic compounds, metals, non-ribosomal polypeptides, 

10 polyketides, and peptidomimetics. The identification of compounds can be 
accomplished by, for instance, incubating the dipeptidylpeptidase with a 
candidate compound under conditions that promote amidolytic activity of the 
dipeptidylpeptidase and determining if the amidolytic activity of the 
dipeptidylpeptidase is decreased relative to the amidolytic activity in the absence 

15 of the compound. The amidolytic activity can be measured by cleavage of the 
peptide bond between the second and third amino acids of the target polypeptide 
as described herein. One method of developing an inhibitor includes using the 
target polypeptide and replacing the Xaa residues with modified amino acids. It 
is expected that some modified amino acids will cause the target polypeptide to 

20 act as an inhibitor. 

The present invention is illustrated by the following examples. It is to be 
understood that the particular examples, materials, amounts, and procedures are 
to be interpreted broadly in accordance with the scope and spirit of the invention 
25 as set forth herein. 

EXAMPLES 

Materials 

Diisopropylfluorophosphate (DFP), leupeptin and 3,4- 
30 dichloroisocoumarin, were purchased from Calbiochem (La Jolla, CA). Other 
peptides used in this study were synthesized at the Molecular Genetic 
Instrumental Facility (University of Georgia, Athens, GA) using Fmoc protocol 
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with an advanced ChemTech MPS350 automated synthesizer. The peptides H- 
Xaa-Xaa-pNA and Z-Xaa-Xaa-pNA, where pNA is p- Nitroanilide; Z is 
benzyloxycarbonyl; and H is hydrogen and denotes an unblocked amino- 
terminal group were obtained from Bachem (King of Prussia, PA). 

5 

Methods 

EXAMPLE 1 

Source and Cultivation of Bacteria - P. gingivalis DPP-7 was purified 
from strain HG66, a kind gift of Dr. Roland Arnold (University of North 
10 Carolina, Chapel Hill, NC). The cells were grown as described previously (Chen 
et al., /. Biol Chem., 267:18896-901 (1992)). 

Protein Determination - Protein concentration was determined with the 
BCA reagent kit (Sigma), using bovine serum albumin as a standard. 

Localization of Dipeptidylpeptidase Activity - The localization of active 
15 enzyme was checked in bacterial cells that had been subjected to a previously 
described fractionation procedure (Banbula et al., Infect Immun., 68:1176-82 
(2000)). All fractions, as well as the full culture, culture medium, and fall 
culture after sonication, were assayed for amidolytic activity against H-AIa-Phe- 
pNA. 

20 Enzyme Purification - All purification steps were performed at 4°C 

except for FPLC separations, which were carried out at room temperature. The 
cells were collected by centrifugation (6,000 x g, 30 minutes) and resuspended in 
50 mM potassium phosphate buffer, pH 7.4. The outer membrane proteins were 
solubilized with 0.05% Triton X-100. After 2 hours of gentle stirring, unbroken 

25 cells were removed by centrifugation (28,000-x g, 60 minutes). Proteins from 
the supernatant were precipitated with cold acetone (60% final concentration), 
collected by centrifugation, and redissolved in 50 mM potassium phosphate 
buffer, pH 7.0. After extensive dialysis against the same buffer the sample was 
loaded onto a hydroxyapatite column (BioRad) previously equilibrated with 20 

30 mM potassium phosphate, pH 7.0, at a flow rate of 20 ml/hour. The column was 
then washed until the A 2 go fell to zero. Bound proteins were eluted with a 
potassium phosphate gradient (20-300 mM) and fractions (7 ml) were analyzed 
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for amidolytic activity against H-Ala-Phe-pNA. The active fractions were 
saturated with 1 M ammonium sulfate and loaded onto a Phenyl-Sepharose HP 
column (Pharmacia) equilibrated with 50 mM potassium phosphate, pH 7.0, 
containing 1M ammonium sulfate. The column was washed with two volumes 
5 of the equilibration buffer, followed by a wash with buffer containing 0.4 M 
ammonium sulfate, and developed with a descending gradient of ammonium 
sulfate from 0.4 to 0 M. Active fractions were pooled, extensively dialyzed 
against 20 mM MES, pH 6.6, and applied onto a MonoS HR 5/5 FPLC 
(Pharmacia) column equilibrated with the same buffer. Bound proteins were 

10 eluted with a 0-300 mM NaCl gradient This allowed us to obtain a homogenous 
preparation of active proteinase. 

Electrophoretic Techniques - The SDS-PAGE system of Schagger and 
von Jagow (Schagger et al., Anal Biochem., 166:368-79 (1987)), was used to 
monitor enzyme purification and estimate the enzyme molecular mass. For 

15 amino-terminal sequence analysis, proteins resolved in SDS-PAGE were 

electroblotted onto polyvinylidene difluoride membranes using 10 mM CAPS, 
pH 11, 10% methanol (Matsudaira et al., 7. Biol Chem., 262:10035-38 (1987)). 
After staining with Coomasie Blue G250 the blot was air dried, and protein 
bands cut out and subjected to amino-terminal sequence analysis with an 

20 Applied Biosystems 49 1 Protein Sequencer using the program designed by the 
manufacturers. 

Kinetic Analysis - Routinely, the dipeptidylpeptidase amidolytic activity 
was measured with H-Ala-Phe-pNA (ImM) in 6.2 M HEPES, pH 7.8 at 37°C. 
The reaction was followed for specific time intervals in a thermostated ELIS A 

25 reader (SpectraMax, Applied Biosystem) and the release of p-nitroaniline was 
monitored at 405 nm. Other p-nitroanilide substrates were used in the same 
manner. For inhibition studies, the enzyme was first preincubated with an 
inhibitor for 15 minutes at 37°C, substrate added, and residual activity recorded. 
The initial steady-state velocity (v 0 ) was determined by continuous assay for the 

30 range of substrate concentrations (100 nM to 1 mM). K m and V max were 
determined by hyperbolic regression of the kinetic data using the software 
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package Hyper Version 1.02 obtained from Dr. J. S. Easterby (University of 
Liverpool, UK). 

Enzyme Fragmentation - The purified dipeptidylpeptidase was subjected 
to in-gel tryptic digestion (Rosenfeld et al., Anal Biochem., 203:173-79 (1992)). 
5 Peptides were extracted and separated by microbore reverse-phase HPLC 

Fractions absorbing at 210 nm were manually collected, and their masses were 
determined by reflectron MALDI-TOF mass spectrometry using a Bruker 
Daltonics ProFlex instrument as described previously (Pohl et al., Lett, Peptide 
Sci., 1:291-97 (1995)). Selected peptides were subjected to Edman degradation 

10 in a model Procise-cLS sequencer (PE Biosystems, CA). 

Identification of the DPP -7 Coding Sequence - An unfinished P. 
gingivalis W83 genome database, available from the Institute for Genomic 
Research, was searched for the presence of nucleotide sequences corresponding 
to the amino-terminal and the internal DPP-7 amino acid sequences using the 

15 TBLASTN algorithm (Altschul et al., Nucleic Acids Res., 25:3389-402 (1997)). 
An identified contig gnl | TIGR I P. gingivalis_1208 was retrieved from the 
Institute for Genomic Research database. The position of the DPP-7 coding 
sequence was localized using the National Center for Biotechnology Information 
(NCBI) open reading frame (ORF) finder and the amino acid sequence, obtained 

20 by conceptual translation of the entire ORF, was further used for homology 
screening by use of the NCBI BLAST search tool. 

Enzyme Specificity - The determination of substrate specificity was based 
on the separation of the products of peptide hydrolysis by reverse-phase 
chromatography. Peptides were first incubated with 1 microgram of DPP-7 at an 

25 enzyme: substrate molar ratio of 1 : 100 for 3 hours or 24 hours in 50 microliters 
of 200 mM HEPES, 100 mM NaCl pH 8.0, at 37°C, and the reaction stopped by 
acidification with trifluoroacetic acid. The samples were then subjected to 
reverse-phase high pressure liquid chromatography using a Supelcosil LC 18 
column (Supelco) with an acetonitrile gradient 0-60% in 0.075% trifluoroacetic 

30 acid in 50 minutes. Each peak, detected at 210 nm, was collected, lyophilized, 
re-dissolved in 50% (v/v) methanol, 0.1% acetic acid and subjected to analysis 
by mass spectrometry. 
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EXAMPLE 2 

A 76 kDa dipeptidylpeptidase associated with P. gingivalis membranes 
was solubilized by mild detergent treatment. This procedure released more than 

5 90% of the amidolytic activity against H-Ala-Phe-pNA into the medium. After 
acetone precipitation and subsequent chromatography steps including the use of 
hydroxyapatite, Phenyl-Sepharose and MonoS columns (Fig. 1) a pure enzyme 
preparation was obtained. The homogeneity of the preparation and molecular 
mass of the protein were checked both by SDS PAGE (Fig. 2) and gel filtration 

10 on a TSK G3000 SW column. 

EXAMPLE 3 

Inhibition Profile - Based on the inhibition studies (Table I), DPP-7 was 
classified as a serine protease. DPP-7 was inactivated by 

1 5 diisopropylfluorophosphate, PEFABLOCK and 3,4-dichlois;oeoumarin, but not 
by typical cysteine class inhibitors such as E-64 or iododoacetic acid. Metal 
chelators including EDTA and 1 ,10-orthophenanthroline, as well as reducing 
agents did not influence its activity. The enzyme was not sensitive to 
inactivation by either detergents (0.5% SDS, 1 % Triton X-100) or heavy metal 

20 ions including Zn 2+ , Co 2 * and Ni 2+ . Human plasma inhibitors, such as (Xi- 

proteinase inhibitor, aj-antichymotrypsin, and a 2 -macroglobulin, did not effect 
enzyme activity nor were they cleaved by DPP-7. 

TABLE 1: Effect of different compounds on P. gingivalis DPP-7 activity. 



Inhibitor Concentration % of residual activity 



Diisopropylfluorophosphate 


10 mM 


34 


PEFABLOCK 


4 mg/ml 


1 


3,4-dichloroisocoumarin 


2mM 


0 


E-64. 


1 micromolar 


96 


Iodoacetic acid 


0.1 mM 


102 


EDTA 


10 mM 


90 


1 ,10-orthophenanthroline 


1 mM 


98 
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Leupeptin 


0.1 mM 


107 


Aprotinin 


0.5 mg/ml 


128 


Pepstatin 


0.5 mg/ml 


127 


Cysteine 


10 mM 


90 


Gly-Ala 


lOOmM 


102 


Arg-Phe 


lOOmM 


69 


Ala-Gly 


100 mM 


96 


Arg-Gly 


10 mM 


84 


Lys-Gly 


10 mM 


96 


Ni~ 


1 mM 


95 


ZrT 


1 mM 


95 


Co^ 


1 mM 


116 


SDS 


0.5% 


65 


SDS 


1% 


0 


Triton X-100 


0.1% 


144 


Triton X-100 


0.5% 


103 


Triton X-100 


1% 


94 



EXAMPLE 4 

pH Optimum and Stability - Purified DPP-7 was active against H-Ala- 
5 Phe-pNA over a broad pH range, from neutral to basic pH (6.5-9.0) (Fig. 3). 

This activity also changed with the ionic strength of the buffer, reaching 200% at 
0.5 M NaCl concentration in 100 mM HEPES, pH 8.0. DPP-7 was stable in 0.2 
M HEPES, pH 8.0, for one week at 4°C. The protease showed no appreciable 
loss of activity when kept frozen at -80°C for one month. After 3 hours 
10 incubation at either room temperature or 37°C, activity was reduced to 62% and 
20%, respectively. The optimum temperature for the hydrolysis of H-Ala-Phe- 
pNA was determined to be 43°C. 
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EXAMPLES 

Substrate Specificity ~ Among several chromogenic substrates tested, 
only those with an aliphatic or an aromatic side chain residues in the second, 
penultimate position were rapidly hydrolyzed by DPP-7 (Table 2). 

5 

TABLE 2: Kinetic analysis for paranitroanalides cleavage by DPP-7. 
Substrate Km [mM] Vmax 



H-Ala-Ala-pNA 0.313 129.65 

H-Ala-Phe-pNA 0.441 170.06 

H-GIy-Phe-pNA 0.256 54.54 



Several other substrates including H-Ala-Pro-pNA, H-Ala-pNA, H-Gly- 
pNA, H-He-pNA, H-Leu-pNA, H-Lys-pNA, H-Phe-pNA, H-Gly-Arg-pNA, H- 

10 Gly-Glu-pNA, H-Gly-Lys-pNA, H-Ala-Gly -pNA, H-Gly-Gly-pNA, H-Ala- 

Al a-Phe-pNA,H- AJ a-Gl y- Arg-pN A, H~Leu-Thr-Arg-pNA, H-Ala-Phe-Pro-pNA, 
Na-benzoyl-DL-arginine-pNA, N-me t- Al a-Pro- V al-pN A , N-suc-Ala-Ala-pNA, 
N-suc-Ala-AJa-Pro-Glu-pNA, N-suc-Ala-Ala-Pro-Leu-pNA, N-suc-Ala-Ala- 
Val-AIa-pNA, Z-Ala-Ala-pNA, Z-Lys-pNA, Z-Arg-pNA, Z-Glu-Glu-pNA, Zr 

15 Leu-Leu-Glu-pNA, Z-Lys-Arg-pNA, Z-Phe-Arg-pNA, Z-Phe-Val-Arg-pNA, Z- 
Tyr-Lys-Arg-pNA were tested, but none of these was hydrolysed by DPP-7. 

To further confirm specificity, several synthetic peptides were also tested 
as substrates for this enzyme. Again, only those polypeptides having an amino 
acid with an aliphatic or an aromatic residue as a substituent on the a-carbon 

20 atom of the second amino acid from the N-terminal end of the polypeptide were 
cleaved (Table 3), with glycine, proline, or charged amino acids not being 
acceptable as the second amino acid from the N-terminal end of the polypeptide. 
The protease did not show any endopeptidase activity on gelatin, insulin (5 chain, 
carboxymethylated lysosyme, azocazein or type I collagen. Purified DPP-7 was 
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devoid of any aminopeptidase activity and did not cleave model substrates with 
blocked amino-terminL 



TABLE 3: Specificity of P. gingivalis DPP-7 on synthetic peptides. 



Peptides cleaved 

Trp-Ala4-Gly-Gly-Asp-Ala-Ser-Gly-Glu 
(SEQ ID NO: 10) 

De-Ala4-Arg-Arg-His-Pro-Tyr-Phe-Leu 
(SEQIDNO:ll) 

Lys-De4-Ala-Gly-Tyr-His-Leu>Glu-Leu 
(SEQIDNO:12) 

Phe-Leu4-Arg-Glu-Pro-Val-He-Phe-Leu 
(SEQIDNO:13) 



Peptides not cleaved 



Trp-ffis-Trp-Leu-Glu-Leu-Lys-Pro-Gly- 
Glu-Pro-Met-Tyr (SEQ ID NO: 14) 
Ser-Pro-Tyr-Ser-Ser-Glu-Thr-Thr 
(SEQ ID NO: 15) 

Ala-Pro-Val-Arg-Ser-Leu (SEQ ID NO: 16) 

Gln-Lys-Gln-Met-Ser-Asp-Arg-Arg-Glu 
(SEQIDNO:17) 



An arrow indicates cleavage site 



EXAMPLE 6 

DPP-7 Sequence Analysis - Purified DPP-7 was resolved on SDS-PAGE 
and electroblotted onto a PVDF membrane. It had an amino-terminal sequence 

10 ADKGMMWLLNELNQENLDRMRELGFT (SEQ ID NO: 18). After 

proteolytic in-gel digestion of the enzyme additional internal sequences were 
obtained, including: DNKPYK (SEQ ID NO: 19), EMTYL (SEQ ID NO:20), 
FAQFAN (SEQ ID NO:21), VLPAML (SEQ ID NO:22), SVVPY (SEQ ID 
NO:23), LFFAGL (SEQ ID NO:24). All of this sequence data allowed us to 

15 identify the P. gingivalis genomic contig gin | TIGR I P. gingivalis^ in the 
Unfinished Microbial Genomes database, TIGR. An ORF corresponding to the 
DPP-7 amino acid sequence (SEQ ID NO:l) was found, as indicated by the fact, 
that all sequences of the DPP-7 derived peptides obtained by the enzyme 
polypeptide fragmentation by trypsin were present in the protein primary 

20 structure inferred from the nucleotide sequence of the ORF as shown in Fig. 4. 
Including a signal peptide (residues 1-24), the entire ORF corresponds to a 712 
amino acid polypeptide (see Fig. 4). Interestingly, the DPP-7 ORF contains the 
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consensus sequence for the active-site serine residue of serine type proteases, 
TGGNSGSPVF (SEQ ID NO:25). As indicated in Fig. 5 the DPP-7 carboxy- 
terminus (SEQ ID NO:3) exhibits high degree of identity to that of the V8 serine 
protease (SEQ ID NO:4), particularly around the putative active site serine 

5 residue. This is surprising since the P. gingivalis DPP-7 is a dipeptidylpeptidase 
specific for substrates having an aliphatic or an aromatic residue as a substituent 
on the a-carbon atom of the second amino acid from the N-terminal end of the 
substrate, whereas Staphyloccocus aureus V8 endopeptidase is specific towards 
substrates including glutamic acid or aspartic acid as the second amino acid from 

10 the N-terminal end of the substrate. The similarity search performed using the 
NCBI TBLASTN tool against GenBank, EMBL, DDBJ and PDB databases 
showed no significant similarity of DPP-7 to any other known 
dipeptidylpeptidases, indicating that this enzyme could be regarded as a member 
of a new family of proteases. Additional searches against databases containing 

15 unfinished and finished microbial genomes allowed us to identify more coding 
sequences encoding similar proteases with consensus active site sequence 
TGGNSGSPV (Fig. 6; SEQ ID NO:26). A coding sequence of related protein 
has been found in P. gingivalis W83 unfinished portion of complete genome 
between positions 1360759 and 1362718. This putative proteinase reveals 

20 significant similarity to DPP-7 (267/691 identities). Another organism 
Shewanella putrefaciens possesses two related coding sequences (gnl 
I TIGRJ24 1 sputre 6401 and gnl | TIGR_24 1 sputre 6410) while a plant 
pathogen Xylellafastidiosa contains one coding sequence encoding similar 
proteinase (gb | AE004008.1 1 ). In addition, the computer assisted search for 

25 sequential motifs characteristic for transmembrane domains revealed the 

presence of two such putative regions within the amino-terminal sequence of 
DPP-7, with residues 7 to 24 and 62 to 78 most likely folded into hydrophobic 
a-helices responsible for membrane anchoring of this enzyme. 

30 The complete disclosure of all patents, patent applications, and 

publications, and electronically available material (e.g., GenBank amino acid 
and nucleotide sequence submissions) cited herein are incorporated by reference. 
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What is claimed is: 

1. An isolated dipeptidylpeptidase, active analog, active fragment, or active 
modification thereof having amidolytic activity for cleavage of a peptide 
bond between the second and third amino acids from the N-terminal end 
of a target polypeptide, wherein the target polypeptide has an aliphatic or 
an aromatic residue as a substituent on the a-carbon atom of the second 
amino acid from the N-terminal end of the polypeptide. 

2. The dipeptidylpeptidase of claim 1 wherein the dipeptidylpeptidase is 
isolated from Porphyromonas gingivalis. 

3. The dipeptidylpeptidase of claim 1 wherein the dipeptidylpeptidase is a 
serine protease. 

4. The dipeptidylpeptidase of claim 1 comprising an amino acid sequence 
TGGNSGSPV (SEQ ID NO:26). 

5. The dipeptidylpeptidase of claim 1 comprising an amino acid sequence 
TGGNSGSPVF (SEQ ID NO:25). 

6. The dipeptidylpeptidase of claim 1 comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO:3, SEQ ID NO: 18, 
SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ 
ID NO:23, SEQ ID NO:24, SEQ ID NO:25, and SEQ ID NO:26. 

7. The dipeptidylpeptidase of claim 1 comprising an amino acid sequence 
SEQIDNO:2. 

8. The dipeptidylpeptidase of claim 1 wherein the dipeptidylpeptidase is 
encoded by a nucleic acid comprising a nucleotide sequence SEQ ID 
NO:l. 
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9. An isolated polypeptide comprising an amino acid sequence having a 
percentage amino acid identity greater than about 40% with SEQ ID 
NO:2. 

10. An isolated nucleic acid comprising a coding sequence encoding a 
dipeptidylpeptidase, active analog, active fragment, or active 
modification thereof having amidolytic activity for cleavage of a peptide 
bond between the second and third amino acids from the N-terminal end 
of a target polypeptide, wherein the target polypeptide has an aliphatic or 
an aromatic residue as a substituent on the a-carbon atom of the second 
amino acid from the N-terminal end of the polypeptide. 

11. The nucleic acid of claim 10 wherein the nucleic acid comprises a 
nucleotide sequence SEQ ID NO: 1 . 

12. The nucleic acid of claim 10 wherein a complement of the nucleic acid 
hybridizes to SEQ ID NO: 1 under hybridization conditions of 0.5 M 
phosphate buffer, pH 7.2, 7% SDS, 10 mM EDTA, at 68°C, followed by 
three for 20 minutes washes in 2x SSC, and 0.1% SDS, at 65°C, wherein 
at least about 20 nucleotides of the complement hybridize. 

13. An isolated nucleic acid encoding a polypeptide, wherein the polypeptide 
comprises an amino acid sequence having a percentage amino acid 
identity greater than about 40% with SEQ ID NO:2. 

14. A method of identifying an inhibitor of a dipeptidylpeptidase, active 
analog, active fragment, or active modification thereof, comprising 
identifying a compound that inhibits the amidolytic activity of the 
dipeptidylpeptidase by incubating the dipeptidylpeptidase with the 
compound under conditions that promote amidolytic activity of the 
dipeptidylpeptidase and determining if the amidolytic activity of the 
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dipeptidylpeptidase is inhibited relative to the amidolytic activity in the 
absence of the compound. 

15. A method of reducing growth of a bacterium comprising inhibiting a 

i 

dipeptidylpeptidase, active analog, active fragment, or active 
modification thereof, by contacting the dipeptidylpeptidase with an 
inhibitor of the dipeptidylpeptidase. 

16. The method of claim 15 wherein the dipeptidylpeptidase is a serine 
protease. 

17. A method for protecting an animal from a periodontal disease caused by 
Porphyromonas gingivalis comprising administering to the animal an 
inhibitor of dipeptidylpeptidase, wherein the disease is selected from the 
group consisting of gingivitis and periodontitis. 

18. The method of claim 17 wherein the inhibitor is administered by a 
method selected from the group consisting of subgingival application and 
controlled release delivery. 



19. An immunogenic composition comprising an isolated 
dipeptidylpeptidase, an antigenic analog, an antigenic fragment, or an 
antigenic modification thereof having amidolytic activity for cleavage of 
a peptide bond present in a target polypeptide, the peptide bond being 
located between the second and third amino acids from the N-terminal 
end of the target polypeptide, wherein the second amino acid from the N- 
terminal end of the polypeptide has an aliphatic or an aromatic residue as 
a substituent on the a-carbon atom. 

20. The composition of claim 19 wherein the dipeptidylpeptidase is a serine 
protease. 
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21. The composition of claim 19 wherein the second amino acid is selected 
from the group consisting of alanine, phenylalanine, isoleucine, and 
leucine. 

22. The immunogenic composition of claim 19 further comprising an 
adjuvant. 

23. A composition comprising an inhibitor of an isolated dipeptidylpeptidase 
and a pharmaceutically acceptable carrier. 
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4/7 

£?J7> t&Z/\__ atgcaaatgaaattaaaaagtattcttctcggagcagccctgctgttgggtgcttcaggggtagccaaagccgacaaaggcatgtggctc 90 

~ _ iT MQMKI.KSII.LG A * A LLLGASGVAK A P K G M W L 30 

ctcaacgaactcaatcaggagaatctggatcgaatgcgtgagctcggctttacgctcccgttggattcgctctacagtttcgacaagccg 180 

LHBLNQBHLPRMRBLGFT LPLDSLYSFDKP 60 

tccattgccaatgccgtggttatcttcggtggcggatgtaccggtatcacagtgtccgatcagggcctgatctttaccaaccaccactgc 270 

SIAHAVVI FGGGCTGITVSDQGLI FTMHHC 90 

ggatacggtgctatccagagccaaagcacggtggatcacgac tatctgcgcgatggtttcgtttctcgcacgatgggtgaggagcttccg 360 

GYGAXQSQ STVDHDYE'RDGFVSRTMGBBL P 120 

attccgggtctttccgtgaagtatctgcgcaagatcgtgaaggtaacggacaaggt^gaaggacagctcaagggtatcactgacgagatg 450 

IPGLSVKYLRKIVKVTDKVBGQLKGITDBM 150 

gagcgtc tgcgcaaagc tcaggaggtatgccaagaac tggccaaaaaagaaaatgcagacgagaaccaac tc tgcatcgtagagcct t tc 54 0 

ERLRKAQBVCQELAKKBWADBNQLCIVBP P 180 

tattccaacaacgaatacttcctcatcgtctacgatgtattcaaggacgttcgtatggtatttgctcctcccagctctgtaggtaagttc 630 

YSNNBYPL tVTDVFK DVRMVFAPP SSVGK P 210 

ggaggcgatacggacaactggatgtggccgcgtcacacgggcgac ttcagcgtat tccgcgtgtatgccggtgccgacaaccggccggcc 720 

GGDTDNWMWP RHTGDPSVFRVYAGADNRP A 240 

gaatacagcaaggacaataaacccfcataagcccgtttacttcgctgccgtatccatgeaaggctacaaggctgacgactatgccafcgacc 810 

E Y S K P N K P Y K PVYPAAVSMQGYKADDYAMT 270 

atcggtttcccgggcagtacggatcgctacctcacttcttggggtgtggaagatcgtatcgaaaacgagaacaatcctcgtatcgaagtt 900 

IGFPGSTDRYLTSWGVBDRIBNBNNPRIBV 300 

cgcggtatcaagcaaggcatctggaaggaagccatgagcgcagatcaggctacccgtatcaaatatgccagcaagtatgctcagagtgct 990 

RGIKQGIWKEAMSADQATRIKYASKYAQS A 330 

aactattggaagaattcgatcggtatgaaccgcggtctcgctcgtcttgacgtgataggtcgtaagcgtgccgaggaaagagcattcgca 1080 

NYWKNSI GMN RGLARLDVI GRKRABBRAF A 360 

gactggatccgtaagaacggcaagagtgctgtctatggcgatgtattgtcttctctcgaaaaggcttataaggaaggagccaaggccaac- 1170 

DWIRKNGKSAVYGDVLS SLE.KAYKEGAKAN 390 

cgtgagatgacttatttgagcgagacgctcttcggtggtaccgaggtggttcgttttgcacagtttgccaacgcattggctacaaatcct 1260 

R S M T Y I. S B TI# FGGTEVVR FAQ FAN A L A T N P 420 

gatgctcatgccggtatcctcaaatcgcttgacgacaagtacaaagactacctcccctcgctcgaccgtaaggtgctgcccgccatgctc 1350 

DA H AG XI. K SLDDKYKDYLP SLDRK V L P A M L 450 

gatattgtacgccggcgtatccctgccgacaagctccccgatatattcaagaatgtaatcgacaagaaattxiaaaggcgacacgaagaag 1440 

DIVRRRI PADKLP.DIFKNVIDKKPKGDTKK 480 

tatgcagacttcgtattcgacaagagtgtggttccttatagcgacaagttccatgccatgctcaagtccatggacaaggaaaagtttgcc 1530 

YADFVFDK S V V P Y SDKFHAMLKSMDKBKPA 510 

aaggctatcgagaaagatccggcagtagagctttccaagagcgtaatagctgctgctcgcgctattcaggccgatgcgatggccaatgcc 1620 

KAIEKDPAVBLSKSVIAAARAIQADAMANA 540 

tatgcca ttgagaagggcaagcgtc ttttc tttgccggtttgcgtgagatgtaccccggacgtgctc tgccgagcgatgccaacttcacc 17 10 

YAIBKGKR LP F A G L RBMYPGRALP SDANF T 570 

atgcgtatgagctacggctccatcaagggatatgaaccgeaggacggtgcctggtacaactatcatacgacaggcaagggcgtattggag 1800 

MRMSYGSIKGYBPQDGAWYNYHTXGKGVL B 600 

aagcaggatcctaagagcga tgagtttgccgtacaggagaatatcc tcgacctc ttccgcaccaaaaactatggtcgctatgccgagaac 18 9 0 

KQDPKSDBFAVQKMILD LFRTKNYGRYARN 630 

ggtcagc tccatatcgctttcc ta tcgaacaacgacatcacgggcggtaactccggtagccccgtattcgataagaacggccgtc tgatc 1980 

GQLHIAFLSNHDITGGN§GSPVPDKNGRL I 660 

ggtcttgctttcgatggcaactgggaagctatgagtggtgacatcgagttcgaacccgatctgcagcgcacaatcagcgtggacatccgc 2070 

GLAFDGNWEAMSGDIEFEPDLQRTISVDI R 690 

tacgttctcttcatgattgacaaatggggtcagtgcccccgtctcatccaagagctgaagttgatctaa 2139 

YVLFMIDKWGQCPRLIQBLKL1* 713 
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The foregoing detailed description and examples have been given for clarity of 
understanding only. No unnecessary limitations are to be understood therefrom. 
The invention is not limited to the exact details shown and described, for 
variations obvious to one skilled in the art will be included within the invention 
5 defined by the claims. 



SEQUENCE LISTING FREE TEXT 
SEQ ID NO: 1 Coding sequence encoding Porphyromonas gingivalis 

DPP-7 

10 SEQ ID NO:2 Porphyromonas gingivalis DPP-7 

SEQ ID NO:3 C-terminal region of Porphyromonas gingivalis DPP-7 

SEQ ID NO:4 C-terminal region of Staphylococcus aureus V8 

endopeptidase 

SEQ ID NO:5 Coding sequence for Shewanella putrefaciens gnl 

15 |TIGR24|sputre6401 

SEQ ID NO: 6 Coding sequence for Shewanella putrefaciens gnl 

|TIGR_24|sputre6410 

SEQ ID NO:7 Coding sequence for Xylellafastidiosa gb | AE004008. 1 

SEQ ID NO: 8 Coding sequence for Porphyromonas gingivalis gnl 

20 | TIGR | P. gingivalisJCPG.con 

SEQ ID NO: 9 Coding sequence for Porphyromonas gingivalis DPP-7 

gnl | TIGR | P. gingivalisjCPG.con 

SEQ ID NO: 10-17 Synthetic peptides 

SEQ ID NO: 18 N-terminal region of Porphyromonas gingivalis DPP-7 
25 SEQ ID NO: 19-24 Internal sequences of Porphyromonas gingivalis DPP-7 
SEQ ID NO:25-26 Consensus sequences for active sites for serine type 
proteases 
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SEQUENCE LISTING 



<110> THE UNIVERSITY OF GEORGIA RESEARCH FOUNDATION, INC. 



<120> Dipeptidyl peptidases And Methods of use 

<130> 235.00440201 

<150> US 60/246,827 

<151> 2000-11-08 

<160> 26 



<170> Patentin version 3.0 



<210> 1 

<211> 2139 

<212> DNA 

<213> Porphyromonas gingival is 



<400> 1 

atgcaaatga aattaaaaag tattcttctc ggagcagccc tgctgttggg tgcttcaggg 60 

gtagccaaag ccgacaaagg catgtggctc ctcaacgaac tcaatcagga gaatctggat 120 

cgaatgcgtg agctcggctt tacgctcccg ttggattcgc tctacagttt cgacaagccg 180. 

tccattgcca atgccgtggt tatcttcggt ggcggatgta ccggtatcac agtgtccgat 240 

cagggcctga tctttaccaa ccaccactgc ggatacggtg ctatccagag ccaaagcacg 300 

gtggatcacg actatctgcg cgatggtttc gtttctcgca cgatgggtga ggagcttccg 360 

attccgggtc tttccgtgaa gtatctgcgc aagatcgtga aggtaacgga caaggtagaa 420 

ggacagctca agggtatcac tgacgagatg gagcgtctgc gcaaagctca ggaggtatgc 480 

caagaactgg ccaaaaaaga aaatgcagac gagaaccaac tctgcatcgt agagcctttc 540 

tattccaaca acgaatactt cctcatcgtc tacgatgtat tcaaggacgt tcgtatggta 600 

tttgctcctc ccagctctgt aggtaagttc ggaggcgata cggacaactg gatgtggccg 660 

cgtcacacgg gcgacttcag cgtattccgc gtgtataccq. qtgccgacaa ccggccggcc 720 
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gaatacagca 


aggacaataa 


accctataag 


cccgtttact tcgctgccgt atccatgcaa 


780 


ggctacaagg 


ctgacgacta 


tgccatgacc 


atcggtttcc cgggcagtac ggatcgctac 


840 


ctcacttctt 


ggggtgtgga 


agatcgtatc 


gaaaacgaga acaatcctcg tatcgaagtt 


900 


cgcggtatca 


agcaaggcat 


ctggaaggaa 


gccatgagcg cagatcaggc tacccgtatc 


960 


aaatatgcca 


gcaagtatgc 


tcagagtgct 


aactattgga agaattcgat cggtatgaac 


1020 


cgcggtctcg 


ctcgtcttga 


cgtgataggt 


cgtaagcgtg ccgaggaaag agcattcgca 


1080 


gactggatcc 


gtaagaacgg 


caagagtgct 


gtctatggcg atgtattgtc ttctctcgaa 


1140 


aaggcttata 


aggaaggagc 


caaggccaac 


cgtgagatga cttatttgag cgagacgctc 


1200 


ttcggtggta 


ccgaggtggt 


tcgttttgca 


cagtttgcca acgcattggc tacaaatcct 


1260 


gatgctcatg 


ccggtatcct 


caaatcgctt 


gacgacaagt acaaagacta cctcccctcg 


1320 


ctcgaccgta 


aggtgctgcc 


cgccatgctc 


gatattgtac gccggcgtat ccctgccgac 


1380 


aagctccccg 


atatattcaa 


gaatgtaatc 


gacaagaaat tcaaaggcga cacgaagaag 


1440 


tatgcagact 


tcgtattcga 


caagagtgtg 


gttccttata gcgacaagtt ccatgccatg 


1500 


ctcaagtcca 


tggacaagga 


aaagtttgcc 


aaggctatcg agaaagatcc ggcagtagag 


1560 


ctttccaaga 


gcgtaatagc 


tgctgctcgc 


gctattcagg ccgatgcgat ggccaatgcc 


1620 


tatgccattg 


agaagggcaa 


gcgtcttttc 


tttgccggtt tgcgtgagat gtaccccgga 


1680 


cgtgctctgc 


cgagcgatgc 


caacttcacc 


atgcgtatga gctacggctc catcaaggga 


1740 


tatgaaccgc 


aggacggtgc 


ctggtacaac 


tatcatacga caggcaaggg cgtattggag 


1800 


aagcaggatc 


ctaagagcga 


tgagtttgcc 


gtacaggaga atatcctcga cctcttccgc 


1860 


accaaaaact 


atggtcgcta 


tgccgagaac 


ggtcagctcc atatcgcttt cctatcgaac 


1920 


aacgacatca 


cgggcggtaa 


ctccggtagc 


cccgtattcg ataagaacgg ccgtctgatc 


1980 


ggtcttgctt 


tcgatggcaa 


ctgggaagct 


atgagtggtg acatcgagtt cgaacccgat 


2040 


ctgcagcgca 


caatcagcgt 


ggacatccgc 


tacgttctct tcatgattga caaatggggt 


2100 


cagtgccccc 


gtctcatcca 


agagctgaag 


ttgatctaa 


2139 


<210> 2 










<211> 712 










<212> PRT 










<213> Porphyromonas gingival is 







<400> 2 

Met Gin Met Lys Leu Lys Ser lie Leu Leu Gly Ala Ala Leu Leu Leu 
15 10 15 
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Gly Ala ser Gly val Ala Lys Ala Asp Lys Gly Met Trp Leu Leu Asn 
20 25 30 

Glu Leu Asn Gin Glu Asn Leu Asp Arg Met Arg Glu Leu Gly Phe Thr 
35 40 45 

Leu Pro Leu Asp ser Leu Tyr ser Phe Asp Lys Pro Ser lie Ala Asn 
50 55 60 

Ala val val lie Phe Gly Gly Gly Cys Thr Gly lie Thr val Ser Asp 
65 70 75 80 

Gin Gly Leu lie Phe Thr Asn His His cys Gly Tyr Gly Ala He Gin 
85 90 95 

ser Gin Ser Thr val Asp His Asp Tyr Leu Arg Asp Gly Phe Val Ser 
100 105 110 

Arg Thr Met Gly Glu Glu Leu Pro lie Pro Gly Leu Ser val Lys Tyr 
115 120 125 

Leu Arg Lys He val Lys Val Thr Asp Lys val Glu Gly Gin Leu Lys 
130 135 140 

Gly lie Thr Asp Glu Met Glu Arg Leu Arg Lys Ala Gin Glu Val Cys 
145 150 155 160 

Gin Glu Leu Ala Lys Lys Glu Asn Ala Asp Glu Asn Gin Leu cys lie 
165 170 175 

Val Glu Pro Phe Tyr Ser Asn Asn Glu Tyr Phe Leu lie Val Tyr Asp 
180 185 190 

Val Phe Lys Asp Val Arg Met Val Phe Ala Pro Pro Ser ser val Gly 
195 200 205 

Lys Phe Gly Gly Asp Thr Asp Asn Trp Met Trp Pro Arg His Thr Gly 
210 215 220 

Asp Phe ser Val Phe Arg Val Tyr Ala Gly Ala Asp Asn Arg Pro Ala 
225 230 235 240 

Glu Tyr ser Lys Asp Asn Lys pro Tyr Lys pro Val Tyr Phe Ala Ala 
245 250 255 

val ser Met Gin Gly Tyr Lys Ala Asp Asp Tyr Ala Met Thr lie Gly 
260 265 270 

Phe Pro Gly ser Thr Asp Arg Tyr Leu Thr ser Trp Gly Val Glu Asp 
275 280 285 

Arg lie Glu Asn Glu Asn Asn Pro Arg lie Glu Val Arg Gly lie Lys 
290 295 300 

Gin Gly lie Trp Lys Glu Ala Met ser Ala Asp Gin Ala Thr Arg lie 
305 310 315 320 

Lys Tyr Ala Ser Lys Tyr Ala Gin Ser Ala Asn Tyr Trp Lys Asn Ser 
325 330 335 

lie Gly Met Asn Arg Gly Leu Ala Arg Leu Asp val lie Gly Arg Lys 



340 



345 



350 



4/22 



WO 02/38742 PCT/US01/46782 



Arg Ala Glu Glu Arg Ala Phe Ala Asp Trp lie Arg Lys Asn Gly Lys 
355 " 360 365 

Ser Ala val Tyr Gly Asp Val Leu Ser ser Leu Glu Lys Ala Tyr Lys 
370 375 380 

Glu Gly Ala Lys Ala Asn Arg Glu Met Thr Tyr Leu Ser Glu Thr Leu 
385 390 395 400 

Phe Gly Gly Thr Glu Val Val Arg Phe Ala Gin Phe Ala Asn Ala Leu 
405 410 415 

Ala Thr Asn Pro Asp Ala His Ala Gly lie Leu Lys ser Leu Asp Asp 
420 425 430 

Lys Tyr Lys Asp Tyr Leu Pro ser Leu Asp Arg Lys Val Leu Pro Ala 
435 440 445 

Met Leu Asp lie Val Arg Arg Arg lie Pro Ala Asp Lys Leu Pro Asp 
450 455 460 

lie Phe Lys Asn val lie Asp Lys Lys Phe Lys Gly Asp Thr Lys Lys 
465 470 475 480 

Tyr Ala Asp Phe Val Phe Asp Lys Ser val val Pro Tyr ser Asp Lys 
485 490 495 

Phe His Ala Met Leu Lys ser Met Asp Lys Glu Lys Phe Ala Lys Ala 
500 505 510 

lie Glu Lys Asp Pro Ala val Glu Leu Ser Lys Ser val lie Ala Ala 
515 520 525 

Ala Arg Ala lie Gin Ala Asp Ala Met Ala Asn Ala Tyr Ala lie Glu 
530 535 540 

Lys Gly Lys Arg Leu Phe Phe Ala Gly Leu Arg Glu Met Tyr Pro Gly 
545 550 555 560 

Arg Ala Leu Pro ser Asp Ala Asn Phe Thr Met Arg Met ser Tyr Gly 
565 570 575 

Ser lie Lys Gly Tyr Glu Pro Gin Asp Gly Ala Trp Tyr Asn Tyr His 
580 585 590 

Thr Thr Gly Lys Gly Val Leu Glu Lys Gin Asp Pro Lys ser Asp Glu 
595 600 605 

Phe Ala val Gin Glu Asn lie Leu Asp Leu Phe Arg Thr Lys Asn Tyr 
610 615 -620 

Gly Arg Tyr Ala Glu Asn Gly Gin Leu His He Ala Phe Leu Ser Asn 
625 630 635 640 

Asn Asp lie Thr Gly Gly Asn Ser Gly Ser Pro Val phe Asp Lys Asn 
645 650 655 

Gly Arg Leu lie Gly Leu Ala Phe Asp Gly Asn Trp Glu Ala Met ser 
660 665 670 

Gly Asp lie Glu Phe Glu Pro Asp Leu Gin Arg Thr lie Ser Val Asp 
675 680 685 

lie Arg Tyr Val Leu Phe Met He Asp Lys. Trp. Gly Gin cys Pro Arg 
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690 695 700 

Leu lie Gin Glu Leu Lys Leu lie 
705 710 

<210> 3 

<211> 52 

<212> PRT 

<213> Porphyromonas gingival is 

<400> 3 

Thr Gly Gly Asn Ser Gly ser pro val Phe Asp Lys Asn Gly Arg Leu 
1 5 10 15 

He Gly Leu Ala Phe Asp Gly Asn Trp Glu Ala Met ser Gly Asp lie 
20 25 30 

Glu Phe Glu Pro Asp Leu Gin Arg Thr lie Ser Val Asp He Arq Tyr 
35 40 45 

Val Leu Phe Met 
50 

<210> 4 
<211> 52 
<212> PRT 

<213> Staphylococcus amreus 
<400> 4 

Thr Gly Gly Asn Ser Gly Ser Pro Val Phe Asn Glu Lys Asn Glu Val 
15 10 15 

He Gly lie His Trp Gly Gly Val Pro Asn Glu Phe Asn Gly Ala Val 
20 25 30 

Phe lie Asn Glu Asn Val Arg Asn Phe Leu Lys Gin Asn lie Glu Asp 
35 40 45 

lie His Phe Ala 
50 

<210> 5 
<211> 734 
<212> PRT 

<213> Shewanella putrefaciens 
<400> 5 



6/22 





WO 02/38742 



PCT/US01/46782 



Met Ala ser Gin Ala Leu Gly Phe Leu His Gin Asn Gly Leu Asn Thr 
1 5 10 15 

Met Lys Lys Trp Leu Leu ser val Ala Val Ala Ala ser Phe Ala Ser 
20 25 30 

His Ala Asp Glu Gly Met Trp Gin Pro His Gin Leu Pro Ala Met Ala 
35 40 45 

Asp val Leu Lys Ala Lys Gly Leu Glu lie Asp Ala Lys Ser He ser 
50 55 60 

Lys Leu Thr Glu Phe Pro Met Asn Ala val He ser Leu Gly Gly cys 
65 70 75 80 

Thr Ala Ser phe Val Ser Pro Lys Gly Leu val Val Thr Asn His His 
85 90 95 

Cys Ala Tyr Gly ser lie Gin Tyr Asn ser Thr Pro Glu Lys Asn Leu 
100 105 110 

Leu Gin Asp Gly Phe Leu Ala Lys Thr Phe Ala Asp Glu Leu Pro Ala 
115 120 125 

Ala Pro Gly Ser Arg Val Tyr Val Thr Glu Asp Val Thr Asn val Thr 
130 135 140 

Glu Arg val Lys Ala Gly Leu Glu Asn Lys Thr Gly Arg Glu Phe Tyr 
145 150 155 160 

Gin Gly val Glu Asn Gin Glu Lys Ala Leu Val Ala Glu Cys Glu Lys 
165 170 175 

Asp Glu Gly Tyr Arg Cys Gin val Tyr Ser Phe His Gly Gly Leu Glu 
180 185 190 

Tyr Tyr Leu Val Lys Gin Leu Glu lie Arg Asp val Arg Leu Val Tyr 
195 200 205 

Asn Pro Ala Gly Ser val Gly Lys Tyr Gly Gly Asp Val Asp Asn Trp 
210 215 220 

Met Trp Pro Arg His Thr Gly Asp Tyr Ser Phe Tyr Arg Ala Tyr Val 
225 230 235 240 

Ser Lys Asn Gly Lys Pro Ala Glu Phe Ser Ala Asp Asn Val pro Tyr 
245 250 255 

Glu Pro Lys Ser Phe Leu Lys val Ser Ala Lys Gly val Ser Glu Gly 
260 265 270 

Asp Phe Val Met Val Ala Gly Tyr Pro Gly Arg Thr Asn Arg Tyr Arg 
275 280 285 

rhr Ala Thr Glu Val Gin Asn Glu Phe Glu Trp Ala Tyr Pro Glu Gly 
290 295 300 

jys Met Leu Arg Glu Arg Phe lie Glu lie lie Lys Ala Thr Ala Pro 
105 310 315 320 

ilu Gly Ser Asp Glu Arg lie Lys Tyr Glu ser Gin lie Ala Gly Leu 



330 
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Ala Asn Tyr Ala Lys Asn Phe Thr Ser Met lie Glu phe Tyr Gly Lys 
340 345 350 

Ser Thr Met Leu Ala Asp Arg Lys Ala Leu Glu Ala Lys Leu Ala Glu 
355 360 365 

Trp He Ala Lys Asp Ser ser Arg Glu Ala Lys Tyr Gly Lys Thr Leu 
370 375 380 

Ala Glu Leu Asp Ala Leu lie Ala Lys Ser Lys Ala His Gin Glu Arg 
385 . 390 395 400 

Asp Met He Leu Ser Tyr lie Ser Ser Thr Thr Met Leu Pro Thr Ala 
405 410 415 

Asn Asn Leu Tyr Arg Leu Ala His Glu Lys Gin Leu Pro Asp Met Gin 
420 425 430 

Arg Glu Pro Gly Phe Gin Asp Arg Asp Met Thr Arg Phe Lys Ala ser 
435 440 445 

Met Glu Arg lie Asp Arg Arg Tyr Ala Ala Ser Val Asp Lys Ala Val 
450 455 460 

Leu Phe Asp Met Leu Lys Arg Tyr Ala Ala Leu Pro Glu Ala Gin An 
465 470 475 48 

Leu pro Ala Met Asp Lys Ala Phe Gly lie Asp Asn Lys Val Asn Glu 
485 490 495 

Ala Lys Leu Ala Lys Thr Leu Asp Lys Met Tyr Ala Lys Thr Glu Leu 
500 505 510 

Gly Asn Lys Asp Val Arg Leu Ala Trp Met Glu Lys Ser Val Asp Asp 
515 520 525 

Phe Lys Ala ser Lys Asp Pro Phe lie Gin Phe Ala Val Ala Met Tyr 
530 535 540 

Asp Thr Asn Met ser Glu Glu Lys Lys Glu Lys Glu Leu Asp Gly Glu 
545 550 555 560 

Leu Met Lys val Arg Pro Gin Tyr Met Asp Ala lie lie Ala Tyr Asn 
565 570 575 

Leu Glu Gin Gly Lys Pro val Tyr Ala Asp Ala Asn Ser Ser Leu Arg 
580 585 590 

Val Thr val Gly His Val Lys Gly Tyr Ser Pro Lys Asp Gly Leu val 
595 600 605 

Ala Val Pro Phe Thr Arg Leu Glu Gly He Val Gin Lys Asp Thr Gly 
610 615 620 

lie Asp Pro Phe Asp Ala Pro Lys Gin Gin Leu Glu Leu lie Lys Gin 
625 630 635 640 

Lys Gin Tyr Gly Asp Phe Tyr Met Lys Ser lie Asp ser Val Pro val 
645 650 655 

Asn Phe Leu ser Thr Leu Asp Thr Thr Gly Gly Asn Ser Gly Ser Pro 
660 665 670 

Thr Leu Asn Gly Arg Ala Glu Leu val GlyJLeu Leu Phe Asp Gly Val 
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675 680 685 

Tyr Glu Ser He lie Gly Gly Trp Ala Phe Asp Asn Glu He Asn Arg 
690 695 700 

Ser lie His val Asp Ser Arg Tyr Met Leu Trp val Met Lys Tyr Leu 
705 710 715 720 

Asp His Ala Asp Asn Leu Leu Ala Glu Met Glu lie val Asn 
725 730 

<210> 6 
<211> 732 
<212> PRT 

<213> shewanella putrefaciens 



<400> 6 

Met Arg lie Ala Leu Val Ala Ala Leu Val Leu Thr Cys Gly He Ala 
1 5 10 15 

Thr Ala Asp Glu Gly Gin Trp Gin pro Tyr Gin Met Pro ser lie Ala 
20 25 30 

Asp Lys Leu Ser Ala Arg Gly lie Asp lie Pro Ala Asp Lys Leu Ala 
35 40 45 

Asp Leu Thr Ser Tyr Pro Met Asn Ala val Val Gly Leu Gly Tyr cys 
50 55 60 

Thr Ala ser Phe val Ser Pro Gin Gly Leu val Val Thr Asn His His 
65 70 75 80 

Cys Ala Tyr Lys Ala lie Gin Tyr Asn Thr Lys Lys Glu His Asn Tyr 
85 90 95 

Leu Glu Gin Gly Phe Leu Ala Thr ser Met Asp Lys Glu Pro Ser Ala 
100 105 110 

Gly Pro Asn Glu Arg Leu Tyr lie Thr Glu Ala Val Thr Asp val Thr 
115 120 125 

ser Asp val Thr Lys Asp Leu Ser Gin Asp Pro Leu Lys Arg Tyr Glu 
130 K 135 140 

Glu lie Glu Asn His Ser Lys Ala Leu lie Lys Ser Cys Glu Ala Asp 
145 150 155 160 

Asp Asn Tyr Arg cys Asn val Arg Ser Phe His Asn Gly Leu Glu Tyr 
165 " 170 175 

Tyr Leu lie Lys Gin Leu Met lie Arg Asp Val Arg Leu Val Tyr Ala 
180 185 190 

Pro Pro Glu Ser Val Gly Gly Tyr Gly Gly Asp lie Asp Asn Tyr Glu 
195 200 205 

Tyr Pro Arg His Ser Gly Asp Phe Ala Phe Leu Arg Ala Tyr Val Gly 
210 215 .220 
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Lys Asp Gly Lys Pro Ala Ala Phe Ser Glu Asp Asn He Pro Tyr Thr 
225 230 235 240 

Pro Lys ser Tyr Leu Lys Val Asn Ala Asp Gly Val Lys Ala Gly Asp 
245 250 255 

Gly val Phe val Ala Gly Tyr Pro Gly Thr Thr Asn Arg Tyr Asn Leu 
260 ' 265 270 

Thr ser Glu Leu Lys Phe Ala Ser Asp Trp Leu Tyr Pro Thr Gin Ala 
275 280 285 

Lys Arg Tyr Gin Leu Gin lie Asp Thr lie Glu Ala Met Gly Gin Lys 
290 295 300 

Asp Ala Asp lie Ala lie Lys Tyr Ala Gly Asn Met Ala ser Met Ala 
305 310 315 320 

Asn Arg Met Lys Lys Leu Asn Gly Leu Leu Ala Gly Phe Lys Ala Thr 
325 330 335 

Asp lie Val Gly lie Lys Gin Gin Arg Glu Asn Asp Phe Leu Ala Trp 
340 345 350 

Leu Thr Lys Asn Pro Asn Leu Asn Gin Asn Leu lie Ser Glu Leu Glu 
355 360 365 

Val Leu Leu Ala Glu Gin Gin Leu Gin Thr Gin Thr Asn Tyr Tyr Phe 
370 375 380 

Thr Asn Ala Gin Ser ser Thr Leu Leu Thr Ala Ala Asn Asn Leu Tyr 
385 390 395 400 

Arg Leu Ala Lys Glu Lys Gin Lys ser Asp Ala Glu Arg Glu lie Gly 
405 410 415 

Tyr Gin Glu Arg Asp Leu Ala Met Phe Ser Ser Arg Leu Lys Arg lie 
420 425 430 

Asp ser Ser Phe Asp Val Lys Val Asp Lys Thr Leu Trp Leu Gin Asp 
435 440 445 

Leu Asn Ala Tyr Leu ser Gin pro Asn Arg val Ala Ala Leu Asp Asn 
450 455 460 

Met Leu Asn Leu Asn Asp Lys Asn Val ser Leu Ala Ala Lys Leu Asp 
465 470 475 480 

Gly Leu Tyr ser Leu Thr Thr Leu Thr Asp Gin Ala Gin Arg Leu Ala 
485 490 495 

Trp Met Glu Ala Asp Ala Lys Ala Phe Glu Thr Ser Ser Asp Pro Phe 
500 505 510 

lie Arg Leu Ala Val Ala Leu Tyr Asp Thr Asn Met Ala Gin Glu Lys 
515 520 525 

Ala Glu Lys lie Leu Ala Gly Lys Leu ser Thr Ala Arg Pro Ala Tyr 
530 535 540 

Met Ala Ala Val lie Asp Tyr Tyr Lys Ala Asn Asn Trp Pro Val Tyr 



545 



550 



555 



560 
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Pro. Asp Ala Asn Gly Thr Leu Arg He Ser Tyr Gly Met val Asp Gly 
565 570 575 

Tyr Gin Ser Arg Asp Ala Leu Tyr Lys Gin Pro Phe Thr Arg Leu Asp 
580 585 590 

Gly lie val Ala Lys His Thr Gly val Glu Pro Tyr Asn Ala Pro Lys 
595 600 605 

Lys Leu Leu Asp Ala lie ser Val Gin Arg Phe Gly Asp His Leu val 
610 615 620 

Lys ser Val Tyr Gin Asp pro Arg Gly Trp lie Cys Arg Leu Phe Ser 
625 630 635 640 

Cys Leu Asp Lys Pro Glu Glu Phe Asn ser Val Pro val Asn Phe Leu 
645 650 655 

ser ser Val Asp Thr Thr Gly Gly Asn ser Gly Ser Pro val Phe Asn 
660 665 670 

Gly Lys Gly Glu Leu Val Gly Leu Asn Phe Asp ser Thr Tyr Glu Ala 
675 680 685 

lie Thr Lys Asp Trp Phe Phe Asn Pro Thr lie Thr Arg Ala val His 
690 695 700 

val Asp He Arg Tyr lie Leu Trp Met Met Asp Glu Val Asp His Ala 
705 710 715 720 

Asp Asn Leu lie Lys Glu Leu Asp Leu val Arg Asn 
725 730 

<210> 7 
<211> 716 
<212> PRT 

<213> xylella fastidiosa 



<400> 7 

Met Arg Phe Asn Leu Leu ser Leu Ser Val Leu Ala Thr Leu lie Thr 
15 10 15 

Val Asp Ser Thr His Ala Gly Glu Gly Met Trp val Pro Gin Gin Leu 
20 25 30 

pro Glu lie Ala Gly Pro Leu Lys Gin Ala Gly Leu Gin Leu Ser Pro 
35 40 45 

Glu Gin Leu ser Asn Leu Thr Gly Asp Pro Met Gly Ala Val Val ser 
50 55 60 

Leu Gly Asn cys Thr Ala Ser Leu val ser Pro Glu Gly Leu Val lie 
65 70 75 • 80 

Thr Asn His His Cys Ala Tyr Gly Ala He Gin Leu Asn Ser Thr Pro 
85 90 95 

Lys Lys Asn Leu He Lys Glu Gly Phe Asn. Ala Leu Thr Gin Ala Asp 
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100 



105 



110 



Glu val ser Ala Gly Pro Asn Ala Arg He Tyr val Leu Glu Gin lie 
115 120 125 

Thr Asp Val Thr Ala Gin Ala Lys Ala Ala Leu Ala Ala Ala Gly Asn 
130 135 140 

Asp Pro Phe Lys Arg Thr Thr Ala Leu Glu Thr Phe ser Lys Gin Glu 
145 150 155 160 

lie Ala Lys Cys Glu Glu Glu Gin Gly Tyr Arg Cys Gin Phe Phe Ser 
165 170 175 

Phe Ala Gly Gly Asn Thr Tyr Arg Val Phe Lys Asn Leu Glu lie Lys 
180 185 190 

Asp val Arg Leu val Tyr Ala Pro Gin Gly Ser val Gly Lys phe Gly 
195 200 205 

Gly Asp val Asp Asn Trp Met Trp Pro Arg His Thr Gly Asp phe ser 
210 215 220 

Phe Tyr Arg Ala Tyr val Gly Lys Asp Gly Lys Pro Ala Ser phe ser 
225 230 235 240 

Lys Glu Asn lie Pro Tyr Arg Pro Lys His Trp Leu Lys Phe ser Asp 
245 " 250 255 

Gin Pro Leu Gly Asp Gly Asp Phe val Met val Ala Gly Tyr pro Gly 
260 265 270 

Arg Thr Asn Arg Tyr Ala Leu Val Ala Glu Phe Glu Asn Thr Ala His 
275 280 285 

Trp Thr Tyr Pro val lie Gly Gin His phe Lys Asn Leu lie Ala Leu 
290 295 300 

lie Glu Ala Ala Ser Lys Gin Asn Pro Asp lie Gin val Lys Tyr Ala 
305 310 315 320 

Ser Thr Leu Ala Gly Leu Asn Asn Thr ser Lys Asn Phe Asp Gly Gin 
325 330 335 

Leu Asp Gly Phe Arg Arg lie Asn Ala lie Gly Gin Lys Gin Ser Glu 
340 345 350 

Glu Thr Ala Val Leu Ala Trp Leu Lys Gin Gin Gly lie Arg Gly His 
355 360 365 

Glu Ala Leu Ala Ala His Gin Thr Leu Val Asp Leu Thr Glu Gin Tyr 
370 375 380 

Lys Ala Asn Gin Asp Arg Asp Phe val Leu Gly Gin Phe Asn Gly ser 
385 390 395 400 

Gly val lie Gly val Ala Val Asn Leu Tyr Arg Leu Ala lie Glu Arg 
405 410 415 

Thr Lys Ser Asp Ala Gin Arg Glu Ala Gly Tyr Gin Glu Arg Asp Leu 
420 425 430 

Pro Thr lie Glu Gly Asn Leu Lys Gin Met Glu Arg Arg Tyr Leu Pro 



435 



440 
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Glu Met Asp Arg Gin Met Gin Gin Tyr Trp Leu Thr Glu Tyr Asn Lys 
450 455 460 

Leu Pro val Lys Gin Arg Val Ala Ala lie Asp val Trp Leu Gly Asp 
465 470 475 480 

Gly He Pro Ala Thr Leu Lys Arg Leu Gly Asp Thr Lys Leu Ser ser 
485 490 495 

Ser Glu Glu Arg Leu Lys Trp Phe Asn Ala Asp Arg Ala Ala Phe Glu 
500 505 510 

Ser Ser Gin Asp Pro Ala lie Arg Tyr Ala Val Ala lie Met Pro Ala 
515 520 525 

Leu Leu Glu lie Glu Arg Gin Asn Lys lie Arg Thr Gly Glu Leu Leu 
530 535 540 

Lys Ala Arg Pro lie Tyr Leu Gin Ala Leu Ala Asp Tyr Asn Lys Ser 
545 550 555 560 

His Gly Lys phe Val Tyr Pro Asp Ala Asn Ser ser Leu Arg lie Thr 
565 570 575 

Phe Gly His Val Lys Gly Tyr Ser Pro Lys Asp Gly Val Glu Tyr Thr 
580 585 590 

Pro Phe Thr Thr Leu Gin Gly Val Met Ala Lys Asn Thr Gly Val Glu 
595 600 605 

Pro Phe Asp Ser pro Lys Ser Leu lie Asn Ala lie Lys Ala Lys Ser 
610 615 620 

Tyr Ala Asn Leu Ala Asp Gin Arg lie Gly Thr Val Pro Val Asn Phe 
625 630 635 640 

Leu Ser Asp Leu Asp lie Thr Gly Gly Asn ser Gly Ser Pro Val Leu 
645 650 655 

Asp Ala His Gly Lys Leu Val Gly Leu Ala Phe Asp Gly Asn Trp Glu 
660 665 670 

Ser Val ser Ser Asn Trp Val Phe Asp Pro Val Met Thr Arg Thr lie 
675 680 685 

Ala val Asp Ser Arg Tyr val Gin Trp lie Met Thr Glu Val Ala Pro 
690 695 700 

Ala Pro His Leu Leu Lys Glu Leu Asn Leu Tyr Arg 
705 710 715 

<210> 8 

<211> 699 

<212> PRT 

<213> Porphyromonas gingival is 



<400> 8 
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10 



20 



25 



35 



40 



50 



65 



70 



75 



85 



90 



100 



105 



115 



120 



130 



135 



145 



150 



155 



165 



170 



180 



185 



195 



200 



210 215 " "" 

Phe Ser val Phe Arg val Tyr Ala Gly Ala 

230 235 

Tyr ser Lys Asp Asn Lys Pro Tyr Lys Pro 

245 250 



290 



295 



305 



310 



315 



Ala Ala 


Leu 


Leu 

IS 


Leu 


Met Trp 


Leu 
30 


Leu 


Asn 


Glu Leu 
45 


Gly 


Phe 


Thr 


Pro Ser 


He 


Ala 


Asn 


lie Thr 


val 


Ser 


Asp 


Tyr Gly 


Ala 


He 
95 


Gin 


Asp Gly 


Phe 
110 


Val 


Ser 


Leu ser 


val 


Lys 


Tyr 


Glu Gly 
140 


Gin 


Leu 


Lys 


Ala Gin 


Glu 


val 


Cys 

-LOU 


Asn Gin 


Leu 


cys 

J./ D 


He 


Leu lie 


Val 
190 


Tyr 


Asp 


Pro. Ser 
205 


Ser 


val 


Gly 


Pro Arg 

220 


His 


Thr 


Gly 


Asp Asn 


Arg 


Pro 


Ala 
240 


val Tyr 


Phe 


Ala 
255 


Ala 


Ala Met 


Thr 
270 


lie 


Gly 


Trp Gly 
285 


Val 


Glu 


Asp 


Val Arg 
300 


Gly 


He 


Lys 


Gin Ala 


Thr 


Arg 


He 
320 


Tyr Trp 


Lys 


Asn 
335 


Ser 


Val He 


Gly 


Arg 


Lys 
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340 



345 



350 



Arg Ala Glu Glu Arg Ala Phe Ala Asp Trp lie Arg Lys Asn Gly Lys 
355 ~ 360 365 

ser Ala Val Tyr Gly Asp Val Leu Ser ser Leu Glu Lys Ala Tyr Lys 
370 375 380 

Glu Gly Ala Lys Ala Asn Arg Glu Met Thr Tyr Leu Ser Glu Thr Leu 
385 390 395 400 

Phe Gly Gly Thr Glu Val Val Arg Phe Ala Gin Phe Ala Asn Ala Leu 
405 ~ 410 415 

Ala Thr Asn Pro Asp Ala His Ala Gly lie Leu Lys ser Leu Asp Asp 
420 425 430 

Lys Tyr Lys Asp Tyr Leu Pro Ser Leu Asp Arg Lys Val Leu Pro Ala 
435 440 445 

Met Leu Asp lie val Arg Arg Arg lie Pro Ala Asp Lys Leu Pro Asp 
450 455 460 

He Phe Lys Asn Val He Asp Lys Lys Phe Lys Gly Asp Thr Lys Lys 
465 470 475 480 

Tyr Ala Asp Phe Val Phe Asp Lys ser val Val Pro Tyr ser Asp Lys 
485 490 495 

Phe His Ala Met Leu Lys Ser Met Asp Lys Glu Lys Phe Ala Lys Ala 
500 505 510 

lie Glu Lys Asp Pro Ala Val Glu Leu ser Lys ser Val lie Ala Ala 
515 520 525 

Ala Arg Ala lie Gin Ala Asp Ala Met Ala Asn Ala Tyr Ala lie Glu 
530 535 540 

Lys Gly Lys Arg Leu Phe Phe Ala Gly Leu Arg Glu Met Tyr Pro Gly 
545 550 555 560 

Arg Ala Leu Pro ser Asp Ala Asn Phe Thr Met Arg Met Ser Tyr Gly 
565 570 575 

ser lie Lys Gly Tyr Glu Pro Gin Asp Gly Ala Trp Tyr Asn Tyr His 
580 585 590 

Thr Thr Gly Lys Gly val Leu Glu Lys Gin Asp Pro Lys Ser Asp Glu 
595 600 605 

Phe Ala val Gin Glu Asn lie Leu Asp Leu Phe Arg Thr Lys Asn Tyr 
610 615 620 

Gly Arg Tyr Ala Glu Asn Gly Gin Leu His lie Ala Phe Leu ser Asn 
625 630 635 640 

Asn Asp He Thr Gly Gly Asn Ser Gly Ser Pro val Phe Asp Lys Asn 
645 650 655 

Gly Arg Leu lie Gly Leu Ala Phe Asp Gly Asn Trp Glu Ala Met ser 
660 665 670 

Gly Asp lie Glu Phe Glu Pro Asp Leu Gin Arg Thr lie Ser Val Asp 
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lie Arg Tyr Val, Leu Phe Met lie Asp Lys Trp 
690 695 

<210> 9 

<211> 720 

<212> PRT 

<213> Porphyromonas gingival is 
<400> 9 

Met Lys Lys Arg Leu Leu Leu Pro Leu Phe Ala val Leu Cys Leu cys 

Gin lie Ala His Ala Asp Glu Gly Met Trp Leu Met Gin Gin Leu Gly 
20 25 30 

Arg Lys Tyr Ala Gin Met Lys Glu Arg Gly Leu Lys Met Lys Glu Tyr 
. 35 40 45 

Asp Leu Tyr Asn Pro Asn Gly Thr ser Leu Lys Asp Ala val val Leu 
50 55 60 

Phe Asp Gly Gly Cys Thr Gly Glu val Val Ser Asp Arg Gly Leu val 
65 70 75 80 

Leu Thr Asn His Hi.s cys Gly Tyr Asp Met lie Gin Ala His ser Thr 
85 90 95 

Leu Glu His Asn Tyr Leu Glu Asn Gly Phe Trp Ala Met Arg Glu Ala 
100 105 HO 

Asp Glu Leu Pro Asn Lys Asp lie Ser val Val Phe lie Asp Lys lie 
115 120 125 

Glu Asp val Thr Asp Tyr val Lys Lys Asp Leu Lys Ala lie Lys Asp 
130 135 140 

Pro Asn ser Met Asp Tyr Leu ser pro Lys Tyr Leu Gin Lys Leu Ala 
145 150 155 160 

Asp Lys Lys Ala Gly Lys Asn Phe ser Ala Lys Asn Pro Gly Leu Ser 
165 170 175 

val Glu lie Lys Ala Phe Tyr Gly Gly Asn Leu Tyr Leu Met Phe Thr 
180 185 190 

Lys Lys Thr Tyr Thr Asp val Arg Leu val Gly Ala Pro Pro Thr ser 
195 200 205 

lie Gly Lys Phe Gly Ala Asp Thr Asp Asn Trp He Trp Pro Arg His 
210 215 220 

Thr Gly Asp Phe Ser He Phe Arg lie Tyr Ala Asp Lys Asn Gly Asn 
225 230 235 240 

pro Ala Pro Tyr Ser Glu Asp Asn val Pro Leu Lys Pro Lys Arg Phe 
245 250 255 
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Phe Asn 

Met Gly 

Asp Glu 
290 

lie Arg 
305 

Lys lie 

Arg Ala 

Asn Lys 

Gly Thr 
370 

Ala Lys 
385 

lie lie 

Glu Thr 

lie Asp 

ser Ala 
450 

Leu Lys 
465 

Lys Asp 
Ala Arg 
val pro 



ser ser 
530 

Tyr Asp 
545 



lie ser 
260 

Phe Pro 

275 

Trp Lys 

Gin Gly 

Met Tyr 

lie Gly 
340 

Gin Ala 
355 

Pro Arg 

Arg Ala 

Arg Gly 

Lys Ala 
420 

Lys lie 
435 

Glu Val 

Glu He 

Arg Phe 

Ser Val 
500 

Ser Val 
515 

Val Phe 
Asp Pro 
Met Asp 



Leu Gly Gly Val 
Gly Thr 
ser lie 



Thr His 

280 



Leu Glu 
Leu Arg 
Val Tyr Tyr Gly 



Phe Thr 
580 



Val Met 
310 

ser Ala 
325 

Ala Asn 

Met Gin 

Tyr Glu 

Asp Leu 
390 

He Glu 
405 

Leu Gin 

Arg Thr 

Asp Lys 

Pro Tyr 
470 

Ala Gly 
485 

Phe Gly 
Glu Lys 
Asp Glu 

lie Leu 

550 

Gly Asp 
565 

Tyr Gly 
His Gin 



Asp Asn 
295 

Leu Arg 

Lys Tyr 

Trp Ala 

Asp Arg 
360 

Glu Ala 

375 

Arg Arg 

Phe Ala 

Gly Asn 

Arg Tyr 
440 

Lys Val 
455 

Glu Asn 
Asp val 
ser Glu 



Leu Ala 
520 

Tyr Arg 
535 

Arg Ala 



Gin Glu Asn Asp Tyr Ala Met lie 

265 270 , 

Arg Tyr Phe Thr Ala Ser Glu Val 

285 

Asp lie Arg lie Arg Met Arg Asp 
300 

Glu Met Leu Ala Asp Pro Gin He 
315 320 

Ala Ala Ser Gin Asn Ala Tyr Lys 
330 335 

lie Lys Thr Arg Gly Leu Arg Gin 
345 350 

Leu lie Ala Trp Gly Ala Lys Gin 
365 

Val His Glu lie Asp Ala Thr val 
380 

Arg Tyr Trp Met lie Glii Glu Gly 
395 400 

Arg ser pro lie Pro Thr Glu Asp 
410 415 

Asp Ala Ser Ala Arg Lys Glu Ala 
425 430 

ser Lys Phe Ala Asn Lys Asp Tyr 
445 

Ala Val Ala Met Leu Thr Glu Tyr 
460 

Leu Pro Leu His Leu Arg Leu Val 
475 480 

Gin Ala Tyr Val Asp Asp lie Phe 
490 495 

Ala Gin Phe Asp Ala Phe Ala Ala 
505 510 

Glu Asp Pro Met val Leu Phe Ala 
525 

Lys Leu Tyr Asn Glu Leu Arg Pro 
540 

Gin Arg Thr Tyr lie Ala Gly Leu 
555 560 



Gin Asp 
Gin val 

Thr Thr Leu Asp Gly val Met Glu Lys Glu 



Gin Phe pro Asp Ala Asn Leu Thr 
570 575 

Lys Gly Tyr ser Pro Arg Asp Asn 
585 590 
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595 



600 



605 



Asp Pro Asp Asn Trp 
610 



Glu Phe Val Val Asp Pro Lys Leu Lys Ala Val 
615 620 



Tyr Glu Arg Lys Asp Phe Gly Arg Tyr Ala Asp Arg Ser Gly Arg Met 
625 630 635 640 

pro val Ala Phe Cys Ala Thr Thr His Thr Thr Gly Gly Asn Ser Gly 
645 650 655 

ser pro Val Met Asn Ala Asn Gly Glu Leu lie Gly Leu Asn phe Asp 
660 665 670 

Arg Asn Trp Glu Gly Val Gly Gly Asp He Gin Tyr Leu Ala Asp Tyr 
675 680 685 

Gin Arg Ser lie lie Val Asp lie Arg Tyr Val Leu Leu Val lie Asp 
690 695 700 

Lys Val Gly Gly cys Gin Arg Leu Leu Asp Glu Met Asn lie Val Pro 
705 710 715 720 

<210> 10 

<213> 9 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic peptide 
<400> 10 

Trp Ala Gly Gly Asp Ala ser Gly Glu 

<210> 11 

<211> 9 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic peptide 
<400> 11 

lie Ala Arg Arg His Pro Tyr Phe Leu 

<210> 12 
<211> 9 
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<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic peptide 

<400> 12 

Lys lie Ala Gly Tyr His Leu Glu Leu 
1 5 

<210> 13 

<211> 9 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic peptide 

<400> 13 

Phe Leu Arg Glu Pro val lie Phe Leu 
1 5 

<210> 14 

<211> 13 

<212> PRT 

<213> Artificial 

<220> 

<223> synthetic peptide 

<400> 14 

Trp His Trp Leu Glu Leu Lys Pro Gly Glu pro Met Tyr 
1 5 10 

<210> 15 

<211> 8 

<212> PRT 

<213> Artificial 



19/22 





WO 02/38742 



PCT/US01/46782 



<220> 



<223> Synthetic peptide 



<400> 15 



Ser Pro Tyr ser Ser Glu Thr Thr 
1 5 



<210> 16 

<211> 6 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic peptide 
<400> 16 

Ala Pro Val Arg ser Leu 
1 5 

<210> 17 

<211> 9 - 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic peptide 
<400> 17 

Gin Lys Gin Met Ser Asp Arg Arg Glu 

<210> 18 
<211> 26 
<212> PRT 

<213> Porphyromonas gingival is 
<400> 18 

Ala Asp Lys Gly wet Met Trp Leu Leu Asn Glu Leu Asn Gin Glu Asn 
1^5 10 15 

Leu Asp Arg Met Arg Glu Leu Gly Phe Thr 



20 



25 
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<210> 19 

<211> 6 

<212> PRT 

<213> Porphyromonas gingival is 

<400> 19 

Asp Asn Lys Pro Tyr Lys 

<210> 20 

<211> 5 

<212> PRT 

<213> Porphyromonas gingival is 

<400> 20 

Glu Met Thr Tyr Leu 
1 5 

<210> 21 

<211> 6 

<212> PRT 

<213> Porphyromonas gingival is 

<400> 21 

Phe Ala Gin Phe Ala Asn 
1 5 

<210> 22 

<211> 6 

<212> PRT 

<213> Porphyromonas gingival is 

<400> 22 

Val Leu Pro Ala Met Leu 
1 5 

<210> 23 

<211> 5 
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<212> PRT 

<213> Porphyromonas gingival is 
<400> 23 

Ser val Val Pro Tyr 
1 5 

<210> 24 

<211> 6 

<212> PRT 

<213> Porphyromonas gingival is 

<400> 24 

Leu Phe Phe Ala Gly Leu 
1 5 

<210> 25 

<211> 10 

<212> PRT 

<213> Artificial 

<220> 

<223> consensus sequence 
<400> 25 

Thr Gly Gly Asn Ser Gly Ser Pro Val Phe 
15 10 

<210> 26 

<211> 9 

<212> PRT 

<213> Artificial 

<220> 

<223> Consensus sequence 
<400> 26 

Thr Gly Gly Asn ser Gly Ser Pro Val 
1 5 
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